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Rise of the citizen scientist 


From the oceans to the soil, technology is changing the part that amateurs can play in research. 
But this greater involvement raises concerns that must be addressed. 


holiday this summer? Share the temperature data from your dive 

computer with researchers eager to plug holes in sparse records for 
inshore areas. Nervous about possible pollution from a nearby fracking 
project? Ease your concerns by helping to collect and analyse air samples 
as part of a monitoring project. Stuck at home as the rain pours down? 
Log on to the Internet and spend a couple of hours folding proteins and 
RNA to help university scientists work out how biology does it. 

Citizen science has come a long way from the first distributed- 
computing projects that hoovered up spare processing power on home 
computers to perform calculations or search for alien signals. And it 
has progressed further still since the earliest public surveys of wildlife: 
it was way back in 1900 that the Audubon Society persuaded Ameri- 
cans to exchange their Christmas tradition of shooting birds for a more 
productive effort to count them instead. 

Some professional scientists are sniffy about the role of amateurs, but 
as an increasing number of academic papers makes clear, the results can 
be valuable and can help both to generate data and to inform policy. 

A paper in Geoderma entitled ‘Can citizen science assist digital soil 
mapping?’ (D. G. Rossiter et al. Geoderma 259-260, 71-80; 2015) makes 
the case that, yes, non-specialists can help expert soil scientists to track 
quality, properties and types of soil. It goes further: these amateur soil 
researchers should be recruited to help with existing and future national 
surveys. Civil engineers and construction workers routinely view the 
subsoil, and digging foundations for buildings and trenches for pipelines 
offers a unique look at the spatial variability of different layers. An army 
of geocachers — twenty-first-century treasure hunters — visit harsh ter- 
rain and difficult-to-access places, and could collect soil data. And they 
routinely use satellite navigation to record their journeys. 


S cience is not just for scientists these days. Going on a scuba-diving 


Technology can make scientists of us all. Data churned out by the 
rapid spread of consumer gadgets equipped with satellite navigation, 
cameras and a suite of other sensors, and the ease of sharing the results 
digitally, are driving the boom in citizen science. Volunteers can already 
identify whale songs from recordings, report litter and invasive species, 
and send in the skeletons of fish they have caught and consumed. But 
there is more to being a scientist, of course, than collecting and sharing 
data — especially if the results are to be used to help determine policy. 

Critics have raised concerns about data quality, and some studies 
do find that volunteers are less able to identify plant species than are 
academics and land managers. And there are issues around how to 
reward and recognize the contribution of volunteers, and around 
ensuring that data are shared or kept confidential as appropriate. But 
these problems seem relatively simple to address — not least because 
they reflect points — from authorship to data quality and access — 
that the professional scientific community is already wrestling with. 

More troubling, perhaps, is the potential for conflicts of interest. 
One reason that some citizen scientists volunteer is to advance their 
political objectives. Opponents of fracking, for example, might help 
to track possible pollution because they want to gather evidence of 
harmful effects. When Australian scientists asked people who had 
volunteered to monitor koala populations how the animals should be 
managed, they found that the citizen scientists had strong views on 
protection that did not reflect broader public opinion. 

Scientists and funders are right to encourage the shift from passive 
citizen science — number crunching — to more-active roles, including 
sample collection. But as increased scrutiny falls on the reliability of 
the work of professional scientists, full transparency about the motives 
and ambitions of amateurs is essential. m 


Portfolio boost 


People who fund or manage many research 
projects could doit better with mindful analysis. 


range of research projects, or a university wanting to understand the 
broader benefits of its research. With a clear view of your institu- 
tion's overarching mission, you seek good researchers. You might then 
conclude: as long as they deliver impactful research, your job is done. 
But you could benefit from standing back and analysing the situa- 
tion through different lenses. For example, you could check whether 
research projects display undesirable uniformity in their underlying 
assumptions. You could enable collaboration between research strands 


[== that you are in charge of a foundation funding a substantial 


that happen to be converging on a particular social or technological 
outcome. Or you might find ways to diversify, maximizing the chances 
that something valuable (such as a diagnostic) will emerge even if the 
central goal (such as a vaccine) turns out not to be achievable. 

Such are the virtues of research-portfolio analysis. Those intrigued 
by such mindfulness should turn to a study (M. L. Wallace and I. Rafols 
Minerva 53, 89-115; 2015) that looks at relevant literature from recent 
decades, and highlights the benefits of inward-looking analysis that 
also creates opportunities for transparency and stakeholder engage- 
ment. The study highlights the limitations of corporate approaches to 
portfolio analysis that are geared to financial returns and well-defined 
markets — in contrast to the broader aims of government, philan- 
thropic funders and universities. 

In analysing how projects and themes can be mapped in ways that 
highlight “cognitive proximity’, this work may help those who run 
research portfolios to get better bangs for their bucks. It might even 
help to make the world a better place. = 
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is very welcome. But, as we in Nepal know only too well, the 

development of a vaccine is no guarantee that it will be used. 
Back in 1987, a successful trial of a typhoid vaccine here in Kathmandu 
gave the world a new and highly effective way to protect against one 
of its oldest killers. Yet, ironically, while the tourists and trekkers who 
have flocked in recent decades to Nepal — one of the countries where 
typhoid is still endemic — use the vaccine to protect themselves, most 
local people are denied it. 

The Nepalese government cannot afford the US$10 doses of vaccine 
that could protect millions of its people. But the need is great — par- 
ticularly since the devastating magnitude-7.8 earthquake that rocked 
Nepal in April. In many places, whatever water and sewerage systems 
existed were smashed, and thousands of peo- 
ple still live in temporary shelters and camps, 
increasing their risk of consuming contaminated 
water. In recent months, the monsoon (when 
cases of typhoid typically peak) has made a bad 
situation worse, and there are sporadic reports 
of disease outbreaks. These could yet provoke an 
epidemic in the hardest-hit places, such as the 
Sindhupalchowk and Gorkha districts. 

As typhoid spreads, the risks increase. The most 
severely affected are school-age children, who are 
struck down with a fever and abdominal pain for 
several weeks. Most of these children recover, but 
many develop complications, including hypo- 
tensive shock, perforation of the gut and gastro- 
intestinal haemorrhage. Antibiotic treatment has 
reduced the fatality rate from historic highs of 20% 
to 1-4%, but studies suggest that some typhoid strains are becoming 
resistant. Antibiotics are sold freely over the counter in Nepal and so are 
widely misused. About one in three people who report to a Kathmandu 
doctor with suspected typhoid will already have (mis)used an antibiotic. 

The aftermath of the earthquake has made many poor people even 
poorer. With no health insurance, the average Nepali family now has 
difficulty paying for an ill child to go to hospital. As a result, those 
children who develop typhoid complications are less likely to recover. 

Prevention is key. The World Health Organization (WHO) said 
as much in 2008, when it recommended programmes to vaccinate 
school-age children against typhoid. A year later, the WHO South-East 
Asia Regional Office recommended that such programmes be prior- 
itized for “immediate” implementation. Yet, despite several promises 
and false starts, this has not happened in Nepal. 

The best way to prevent typhoid is to provide 


r Vhe news that a promising vaccine has been tested against Ebola 
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WHILE TOURISTS AND 
TREKKERS USE THE 
VACCINE TO 


PROTECT 
THEMSELVES, 


MOST LOCAL PEOPLE 
ARE DENIED IT. 


Tackle Nepal’s typhoid 
problem now 


As post -earthquake conditions increase the risk of a typhoid epidemic, 
Buddha Basnyat calls for a widespread vaccination programme. 


meantime, wide deployment of a vaccine could be a useful — and 
lifesaving — measure. As part of the post-earthquake recovery plan, 
Nepal should kick-start a long-promised vaccination programme. 

Part of the problem is that the typhoid vaccine recommended by 
the WHO — the one that was tested in Kathmandu — is among the 
most expensive. To buy and deliver enough doses, Nepal will almost 
certainly need financial support. This could come from Gavi, the Vac- 
cine Alliance, based in Geneva, Switzerland, which is committed to 
increasing access to vaccination in poor countries, or from the profits 
of pharmaceutical companies. (Incidentally, another useful vaccine, 
against hepatitis E — a disease with a similar mode of transmission 
that can prove fatal to pregnant women — was also proven in Kath- 
mandu. It, too, is unavailable to local people.) 

At present, just one typhoid vaccine is approved 
by the WHO. It would be cheaper for Nepali 
health officials to deploy an alternative vaccine, 
not pre-qualified by the WHO. It is unclear why 
they have not yet done so; perhaps they are reluc- 
tant to be seen to be going against WHO guidance. 

The decision to vaccinate — whichever 
vaccine is used — must be backed with solid 
research. Here, there is some progress to report. 
Early next year, a multi-site project led by 
researchers at the University of Oxford, UK, will 
start to map the burden of typhoid fever across 
Asia and Africa. The objective of this study is to 
quantify accurately the number of people with 
typhoid fever and complications of the disease in 
countries such as Nepal. This information will be 
useful for vaccination campaigns. 

There are other pressing research needs, too. With no new treat- 
ments in the pipeline, it is important to track resistance to antibiot- 
ics. This can be done with proper microbiological support, including, 
where possible, genetic studies of the typhoid isolates. 

Researchers must also study why some people are more susceptible 
to the disease more than others. Last year, a preliminary genome-wide 
association study of people from Nepal and Vietnam implicated the 
HLA-DRB1 gene asa major contributor to resistance against typhoid 
fever (S. J. Dunstan et al. Nature Genet. 46, 1333-1336; 2014). 

If the Ebola vaccine is as effective as it seems to be, the world will 
have developed a way to tame this awful disease within 40 years of 
its first appearance. Reports of typhoid date back almost 2,000 years, 
but the disease is still killing people. It is not just the new threats that 
deserve our attention. m 


Buddha Basnyat practises medicine in Kathmandu, where he is 
director of the Oxford University Clinical Research Unit at Patan 
Hospital. 

e-mail: buddha. basnyat@ndm.ox.ac.uk 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


How yeast gota 
second genome 


The origin of the duplicate 
genome in baker’s yeast has 
been revealed: it arose when 
cells of two different species 
fused early in yeast’s evolution. 

Whole-genome duplication 
has happened during the 
evolution of many organisms, 
including yeast — in which 
researchers had thought it 
was a spontaneous event. 
Marina Marcet-Houben and 
Toni Gabaldon at the Centre 
for Genomic Regulation in 
Barcelona, Spain, analysed the 
genomes of Saccharomyces 
cerevisiae to reconstruct the 
evolutionary history of each 
of the yeast’s genes. They 
concluded that each of the two 
subgenomes originated from 
a different lineage. Over time, 
there were extensive genomic 
rearrangements and gene loss. 

This hybridization could 
have given yeast an advantage 
by bringing together different 
physiological properties. 
PLoS Biol. 13, e1002220 
(2015) 


ASTRONOMY 


Direct look ata 
small exoplanet 


Astronomers have snapped a 
picture of a planet like those in 
the Solar System but orbiting 
another star — the lowest-mass 
exoplanet ever directly imaged. 


GLACIOLOGY 


Asian glaciers make a hasty retreat 


The Tien Shan mountain range of Central Asia 
(pictured) lost more than one-quarter of its 
glacier mass in the past half-century, and may 


lose a further 50% by 2050. 


Daniel Farinotti of the GFZ German Research 
Centre for Geosciences in Potsdam and his 
colleagues used satellite data and glacier models 
to reconstruct changes in glacier mass and 
area over the past 50 years, extending previous 
decadal records. They found that glacier mass 


Bruce Macintosh of Stanford 
University in California and 
his team used the Gemini 
Planet Imager at the Gemini 
South Telescope in Chile 
to photograph the planet 
orbiting 51 Eridani, a young 
star 30 parsecs from Earth. 

The planet (artist’s impression 
pictured) is roughly twice the 
mass of Jupiter and farther away 
from its star than Saturn is from 
the Sun, but is comparatively 
closer than Uranus. 

Spectroscopic analysis of 
light from the planet revealed 
strong hints of methane in its 
atmosphere — similar to that 
found in the giant planets of 
the Solar System. 

Science http://doi.org/6s6 (2015) 
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fell by more than 25% from 1961 to 2012 — four 
times the global average — and glacier area 
shrank by about 3,000 square kilometres. 


As the climate warms, regional summer 


(2015) 


BIOTECHNOLOGY 


Opiates made in 
yeast from scratch 


Engineered strains of 
laboratory yeast can make 
small quantities of opiates that 
are used in pain medications, 
such as hydrocodone. 

Poppy farms are currently 
the main source of opiates, 
but production is uncertain 
because of weather, pests 
and other factors. Christina 
Smolke of Stanford University 
in California and her 
colleagues introduced more 
than 20 enzymes into a yeast 
strain, including some from 
poppy plants. After 120 hours 


melt water will probably decrease, putting the 
water supply at risk in Kyrgyzstan, Kazakhstan, 
Uzbekistan and parts of China where 
populations are rising, the authors say. 

Nature Geosci. http://dx.doi.org/10.1038/ngeo2513 


of growth using simple sugars, 
the yeast produced thebaine 
— which drug makers convert 
to codeine and other drugs — 
and hydrocodone. 

If the yields could be 
increased by more than 
100,000 times, such microbes 
could provide an alternative 
opiate source, the authors say. 
Science http://doi.org/6s8 (2015) 


Exercise hormone 
is no myth 


A hormone associated with the 
beneficial effects of exercise 
has been found in the blood of 
exercising humans, this time 


EVGENY DUBINCHUK/ALAMY 


DANIELLE FUTSELAAR/FRANCK MARCHIS/SETI INST. 


MARK NEWMAN/FLPA/IMAGEBROKER/REX 


thanks to more robust methods 
than those used in past studies. 

Bruce Spiegelman and 
Steven Gygi of Harvard 
Medical School in Boston, 
Massachusetts, and their 
colleagues reported in 2012 
that exercise boosted levels of 
the hormone irisin in the blood 
of mice and humans. But other 
researchers branded the finding 
a myth, saying that the human 
gene for irisin cannot produce 
a protein and that the tests 
used to detect the hormone 
were flawed. The team has now 
repeated the experiment with 
the more precise technique of 
tandem mass spectrometry, 
which confirmed the original 
findings. 

The technique is expensive 
and slow, but could be used to 
validate higher-throughput 
tests for irisin as they are 
developed, the authors say. 

Cell Metab. http://doi.org/6s9 
(2015) 


Visual brain area 
active during sleep 


Human brain regions involved 
in visual awareness are active 
during certain periods of sleep, 
as if the people were awake 
and seeing images. 

Yuval Nir at Tel Aviv 
University, Israel, and his 
colleagues monitored brain 
activity in people with epilepsy 
who had probes implanted as 
part of their surgical treatment. 
The researchers gathered data 
from the brain region involved 
in visual awareness as the 
people slept, viewed familiar 
images on a computer screen 
or lay awake ina dark room. 
They found that brain-activity 
patterns during rapid eye 
movement (REM) sleep were 
similar to those that occurred 
during image viewing — but 
not to the brain patterns 
recorded when participants 
moved their open eyes while 
lying in darkness. 

The eye movements in REM 
sleep could be directly linked to 
dreaming rather than just being 
an indication of a generally 
aroused brain, the authors say. 
Nature Commun. 6, 7884 (2015) 


How malaria 
raises cancer risk 


The malaria parasite damages 
DNA in mouse white blood 
cells, increasing the chances 

of them becoming cancerous. 
This could explain why 
Burkitt's lymphoma, a cancer 
of mature B cells, is common in 
areas where malaria is endemic. 

Davide Robbiani and Michel 
Nussenzweig at the Rockefeller 
University in New York and 
their colleagues infected 
mice with a malaria parasite 
(Plasmodium chabaudi). This 
caused certain B cells in the 
spleen to proliferate rapidly 
and to express high, sustained 
levels ofan enzyme called AID, 
which breaks DNA. 

Malaria alone did not cause 
cancer in the mice. However, in 
animals lacking the p53 tumour 
suppressor gene, malaria 
promoted a type of lymphoma 
marked by chromosomal 
rearrangements similar to those 
in human Burkitt's lymphoma. 
Cell 162, 727-737 (2015) 


Good cholesterol 
gets even better 


The molecule that removes 
cholesterol from arteries 
could have another protective 
effect in heart disease: curbing 
inflammation. 

High-density lipoprotein 
(HDL; known as the ‘good 
cholesterol’) transports fat 
from blood-vessel walls to 
the liver for excretion. But it 
also carries the fatty signalling 
molecule S1P, which activates 
the anti-inflammatory 
receptor S1P,. Timothy Hla 
of Cornell University in New 
York and his colleagues tested 
the effect of HDL-bound 
S1P on cells that line human 
blood vessels and found that 
it dampened inflammation. 

In mice engineered to develop 
heart disease, those missing 
the S1P gene had more arterial 
plaques than did those with 
the gene. 

The researchers think 
that this further explains the 


RESEARCH HIGHLIGHTS 


THIS WEEK 


SOCIAL SELECTION 


Popular topics 
on social media 


Trials register sees null results rise 


The launch of the clinicaltrials.gov registry in 2000 seems to 

have had a striking impact on reported trial results, according 

to a PLoS ONE study that many researchers have been talking 

about online in the past week. A 1997 US law mandated the 

creation of the registry, requiring researchers from 2000 

onwards to record their trial methods and outcome measures 

before collecting data. The study found that in a sample of 

55 large trials testing heart-disease treatments, 57% of those 

published before 2000 reported positive effects from the 

treatments. But that figure plunged to just 8% in studies that 

were conducted after 2000. Study author Veronica Irvin, 

a health scientist at Oregon State University in Corvallis, 

says this suggests that registering clinical studies is leading 

to more rigorous research. Writing on his NeuroLogica 

Blog, neurologist Steven Novella of Yale University in New 
Haven, Connecticut, called the study 


> NATURE.COM 
For more on 

popular papers: 
go.nature.com/nlurh8 


ability of HDL to stave off 
cardiovascular disease. 
Science Signal. 8, ra79 (2015) 


Beef farming 
brings on warming 


The most climate-friendly 
methods of beef production 
may not be enough to reduce 
the environmental effects of 
raising cattle. 

Raymond Pierrehumbert 
at the University of Oxford, 
UK, and Gidon Eshel at Bard 
College in Annandale-on- 
Hudson in New York modelled 
the warming effects of five 
different ways of producing 
beef, including intensive 
feedlot systems (pictured) and 


“encouraging 
because it casts doubt on previous 
positive results (go.nature.com/tybzvd). 
PLoS ONE 10, e0132382 (2015) 


> 


” but also “a bit frightening” 


pasture-based methods. Their 
analysis was based on the 
known amounts of greenhouse 
gases produced by each method. 
They found that certain 
forms of pastured beef 
production have a smaller 
climate footprint than feedlot 
systems. However, because 
the global population and its 
appetite for beef will only grow, 
the footprint of these methods 
is still sizeable enough that 
even the most efficient systems 
will probably not help climate 
warming to stay below 2°C. 
Environ. Res. Lett. 10,085002 
(2015) 
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SEVEN DAYS 


Access complaint 
US federal agencies, including 
the Food and Drug 
Administration and the 
Centers for Disease Control 
and Prevention, are continuing 
to restrict journalists’ access 
to information, according to 
the Society of Professional 
Journalists in Indianapolis, 
Indiana. In July 2014, the 
society complained that 

staff at the agencies were 
being prohibited from 
communicating with 
journalists and that interviews 
with federal employees were 
being monitored. A survey by 
the society in April this year 
found that science journalists 
are also affected by these 
practices. Ina letter sent to 

US President Barack Obama 
on 10 August, 53 journalism 
and open-government groups 
say that restrictions on access 
to federal information are 
dangerous and should be lifted. 


Greek bailout boost 


The Greek government will 
soon allow researchers to 
apply for research grants 

for the first time in several 
years, after the country’s 
third bailout in five years 

was approved on 14 August. 
The bailout — which must 
still be ratified by eurozone 
parliaments — contains two 
portions of money earmarked 
for research. These include 
€59 million (US$66 million) 
for proposals previously 
frozen in the crisis and 

€53 million for new calls and 
international projects. The 
debt crisis has left researchers 
reliant on international 
grants — some have had to ask 
collaborators abroad for basic 
supplies. It is hoped that the 
funding will help to address 
Greece’ brain drain. See page 
273 for more. 


The news in brief 


Chinese chemicals blast hits port city 


China’s prime minister, Li Keqiang, has 
promised a thorough investigation into 
explosions that occurred on 12 August at 

a chemicals warehouse in the port city of 
Tianjin. The blasts killed more than 100 people, 
with dozens still missing; most of the dead 

and missing are firefighters. Local officials 
confirmed that highly toxic sodium cyanide 
was present at the site, although they said that it 


Muslim climate call 
Islamic leaders have told the 
world’s 1.6 billion Muslims 
that it is their religious duty to 
meet the challenge of climate 
change. The declaration, 
made on 18 August at the 
International Islamic Climate 
Change Conference in 
Istanbul, Turkey, called for 
climate education in mosques 
and Islamic schools, and 
appealed to rich, oil-producing 
nations to lead emissions 
cuts. The announcement was 
accompanied by messages 

of solidarity from other 

faiths. Some observers are 
cautiously optimistic about 

its impact on the behaviour of 
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individuals, but more sceptical 
about its ability to influence 
government policy. 


Record El Nifio 


An El Nifio event developing in 
the eastern Pacific Ocean could 
be one of the strongest on 
record, forecasters with the US 
National Oceanographic and 
Atmospheric Administration 
announced on 13 August. The 
periodic weather pattern — 
signalled by warming waters 
and weakening trade winds — 
often leads to wetter winters in 
the southern United States and 
dry conditions in Australia, 
Indonesia and India. In 
California, where drought has 
decimated state water supplies, 
the rains would be welcome, 


would be neutralized. The blasts also damaged 
the building that houses China’s supercomputer, 
Tianhe-1, which has been used extensively in 
the country’s space programme. The computer 
was running smoothly after the explosions, 

but was shut down for security reasons. 

Local media aired allegations that regulatory 
impropriety allowed the warehouse to be sited 
too close to residential and business areas. 


but forecasters say that one 
strong El Nino will probably 
not slake the state's thirst. 
See go.nature.com/tjgdhz 
for more. 


Fossil-fuel plea 
Almost 1,000 health 
professionals and academics 
have signed a letter asking 
two of the world’s biggest 
medical charities to sell off 
investments in fossil-fuel 
companies. The letter, released 
on 14 August, calls on the Bill 
& Melinda Gates Foundation 
in Seattle, Washington, and 
the London-based Wellcome 
Trust to divest their shares in 
the world’s 200 largest fossil- 
fuel companies within 5 years. 
According to 2014 financial 


YUE YUEWEI/XINHUA PRESS/CORBIS 


= documents, the Wellcome’s 
£18-billion (US$28-billion) 
endowment included 
investments totalling more 
than £500 million in several 
such companies. Its director, 
Jeremy Farrar, has previously 
defended these investments, 
arguing that they give the 
charity leverage to influence 
the companies’ practices. 


} RESEARCH 
Gamer danger 


Exposure to violent video 
games is linked to aggressive 
behaviour in older children, 
adolescents and young adults, 
according to a report issued by 
the American Psychological 
Association (APA) on 

13 August. The organization 
reached its conclusion on the 
basis of 31 studies published 
in 2009-13. It could not, 
however, determine the 
relationship between video 
games and criminal behaviour 
or delinquency. In response 

to the findings, the APA has 
adopted a resolution that calls 
for a refined rating system for 
video games to better reflect 
violent content. 


LEE THOMAS/ALA\ 


Ice delays science 
The Canadian Coast Guard's 
Amundsen icebreaker 
(pictured), which is being 
used for the Canadian part 
of an Arctic oceanographic 
programme, is due to end the 
first leg of a disrupted cruise 


TREND WATCH 


SOURCE: NPG 


Scientists’ concerns about open- 
access (OA) publishing might be 


fading, according to the results of 


a large Nature Publishing Group 
survey released on 13 August 
(see go.nature.com/t2cxrq). 
The poll found that more than 
one-third of scientists have not 
published any OA papers in the 
past three years, a proportion 
that has shifted little since 2014. 
But last year, 40% of those 
researchers expressed concerns 
about perceptions of the quality 


of OA publications; this year, that 


proportion dropped to 27%. 


this week. The research trip 
is part of the international 
GEOTRACES programme 
studying pan-Arctic Ocean 
geochemistry. Soon after the 
Amundsen left Quebec City 
on 10 July, the Coast Guard 
diverted it to break ice for 
commercial vessels in Hudson 
Bay, despite objections from 
the cruise’s chief scientist, 
Roger Francois of the 
University of British Columbia 
in Vancouver. Although it 
lost two weeks, the cruise was 
able to salvage nearly all of 

its science, but Francois says 
that the decision underscores 
“systemic inadequacies” in 
Canadian oceanographic 
research. 


Pollution deaths 
Air pollution in China is 
responsible for nearly 17% 
of the country’s deaths 

each year, according toa 
study by Berkeley Earth in 
California, a climate-science 


non-profit organization (see 
go.nature.com/svbild). The 
report analysed 4 months of 
data from more than 1,500 
pollution-monitoring sites 
around China. It found that, on 
average, 38% of the population 
breathes air deemed unhealthy 
by US Environmental 
Protection Agency standards. 
The estimate that air pollution 
kills 1.6 million people in 
China annually is an increase 
of 400,000 over previous 
estimates. 


Insect investment 


Synthetic-biology firm 
Intrexon announced on 

10 August that it will 

buy Oxitec, a company 

in Abingdon, UK, that is 
developing genetically 
modified insects. The deal 

is worth US$160 million. 
Oxitec creates sterile Aedes 
aegypti mosquitoes, which are 


OPEN-ACCESS PERCEPTIONS CHANGING 


Survey results reveal how scientists’ attitudes towards 
open-access (OA) publishing have shifted in the past 12 months. 


| am concerned about perceptions 
of the quality of OA publications 


lam not willing to pay an 
APC* to publish an article 


| was unable to 
fund an APC 


No OA option for the journal 
| wanted to publish in 


| believe that self-archiving after 
an embargo period is sufficient 


| was not aware of OA 
as a publishing model 


*Article processing charge 
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SEVEN DAYS | THIS WEEK | 


23-27 AUGUST 
Ecologists gather 

in Manchester, UK, 

to discuss resilient 
ecosystems at the 6th 
World Conference on 
Ecological Restoration. 
www.ser2015.0rg 


23-28 AUGUST 

Eger, Hungary, hosts the 
world’s microscopists as 
they gather for the 12th 
Multinational Congress 
on Microscopy. 
www.mem2015.com 


24-28 AUGUST 
The 10th Annual 
Immunotherapy 
and Vaccine Summit 
is held in Boston, 
Massachusetts. 
www.imvacs.com 


released in Brazil to reduce 
the size of the mosquito 
population and slow the 
spread of dengue. Intrexon, 

in Germantown, Maryland, 
has also invested heavily in 
AquaBounty, a biotechnology 
company in Maynard, 
Massachusetts, that makes 
genetically modified salmon. 


Genome venture 
Genome editing continues 

to attract heavy interest from 
investors. Editas Medicine, 
which was founded in 

2013 to develop therapies 
using the gene-editing 
technique known as CRISPR, 
announced on 10 August that 
it had raised US$120 million. 
Investors in the Cambridge, 
Massachusetts, company 
include inventor and 
philanthropist Bill Gates and 
Google Ventures, the venture- 
capital arm of the technology 
giant in Mountain View, 
California. The versatility of 
CRISPR has fuelled optimism 
about its potential applications 
in agriculture and medicine. 


> NATURE.COM 
For daily news updates see: 
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Greek Prime Minister Alexis Tsipras addresses parliament during the vote on a bailout deal in Athens on 14 August. 


ECONOMIC CRISIS 


Greek bailout set to free 
up research funds 


Research minister says two chunks of money from the European Union can now be released. 


BY ALISON ABBOTT 


hen the banks closed in Greece at 
the start of July, Vassilis Gorgoulis, 
a cancer researcher at the Univer- 


sity of Athens, was poised to start experiments 
that journal referees had requested for a high- 
profile paper. To his horror, he was unable to 
buy the items he needed because money could 
no longer be sent out of the country. “I was 
lucky,’ he says. “My collaborators at the Uni- 
versity of Manchester ordered them for me and 
sent me the kits I needed.” 

Greek scientists who do not have collabora- 
tors abroad are finding it even harder to carry 
on doing experiments. The constraints on the 
flow of capital come on top of massive budget 
and salary cuts in the five years since Greece's 
debt crisis began. And they are accompanied 


by a stultifying increase in bureaucracy 
because all public spending in the country has 
come under intense scrutiny (see Nature 517, 
127-128; 2015). 

On 14 August, the Greek Parliament and the 
European Council of Ministers both approved 
the third bailout of Greece's current economic 
crisis. That, along with slight improvements in 
the country’s financial liquidity, clears the way 
for the release of two large chunks of research 
money, says Costas Fotakis, the Greek research 
minister. One of them will go towards calls 
for research proposals to be put out in Octo- 
ber — the first major calls for several years. 

“This is very encouraging indeed,” says 
Gorgoulis. Without these calls, researchers 
have had to rely on money from international 
grants. Moreover, Fotakis, a prominent laser 
physicist who was appointed to his post in 


January when the left-wing Syriza party took 
power (see Nature http://doi.org/6s4; 2015), 
says that he has plans to reinvigorate research, 
including creating Greece's first fund dedicated 
to research and innovation. 

However, unhappiness with the deal within 
the Syriza party may lead to snap elections and 
introduce more uncertainties. 

Both chunks of cash come from European 
Union (EU) ‘structural-fund’ packages — sub- 
sidies for poor regions that can be used for 
research. One chunk, amounting to €59 million 
(US$66 million), had not been released from 
the structural-fund round that ended in 2014, 
says Fotakis. It will provide funding to back 
calls for proposals that had been frozen because 
of the crisis. The other, for €53 million, is part 
ofa package that will be rolled out over the next 
7 years. As well as funding new calls, the 


20 AUGUST 2015 | VOL 524 | NATURE | 273 


© 2015 Macmillan Publishers Limited. All rights reserved 


| NEWS IN FOCUS 


> second chunk of money will cover fees 
to international research organizations and 
pay for HEAL-Link — a government-paid 
portal that provides scientists with access to 
electronic journals. The portal was closed 
down on 1 July owing toa lack of funds (see 
Nature http://doi.org/6s5; 2015). 

Details of the upcoming calls have not 
been released, but Fotakis says that they 
will focus on proposals that are designed to 
address immediate priorities, such as slow- 
ing or reversing a crippling brain drain and 
making the research environment more 
attractive. But even more important, he 
says, are his plans for the “day after the cri- 
sis”. “We are designing policies for a new 
scientific landscape ready for when things 
start to normalize here,” he says. 

One planned task is to present Parlia- 
ment with a law designed by his ministry 
to make research more efficient by remov- 
ing bureaucratic obstacles that emerged as a 
result of austerity. Those include checks on 
spending insisted on by the organizations 
that represent Greece’s creditors — even 
ordering stationery is now a complicated 
procedure. “This law is ready to go and will 
be put before Parliament just as soon as the 
agreement between Greece and its creditors 
is certain,’ Fotakis says. 

Panayiota Poirazi, a computational biolo- 
gist at the Institute for Molecular Biology 
and Biotechnology in Heraklion, knows 
only too well that researchers face greater 
problems than money. She has a €1.5-mil- 
lion ‘starting grant’ from the European 
Research Council but has not always been 
able to spend it because of the capital con- 
trols. “When the banks closed we could 
not replace an air-conditioning unit in our 
computer server room,’ she says. 

The research and innovation fund that 
Fotakis plans to create would be fed by pub- 
lic and private sources and foster basic and 
applied research, Fotakis says. Science will 
help to promote a “knowledge-based econ- 
omy’, he says, “but also curiosity-driven 
research is needed to make an impact on 
the economy in the long term”. 

It might be difficult to find much private 
investment in Greece right now, he acknow1- 
edges, but he is negotiating for support from 
various sources. One of those is the Euro- 
pean Fund for Strategic Investment, which 
was created earlier this year to stimulate 
investment in high-risk, high-return inno- 
vations (see Nature http://doi.org/6s7; 2015). 

Gorgoulis likes the sound of Fotakis’s 
plans: “This is the way to go if we want to 
have a long-term perspective, if Greece is to 
have a future.’ But some researchers worry 
that the new uncertainties hanging over the 
current government could end up delaying, 
or derailing, the research plans. m 


Additional reporting by Anastasia Christakou. 
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INFECTIOUS DISEASE 


facility to BSL-4 status. 
BY HELEN SHEN 


decision to upgrade an existing infectious- 

disease lab to handle the most-hazardous 
pathogens. The move sweeps away more 
than three decades of political opposition to 
operating a top-biosafety-level facility 30 kil- 
ometres west of Tokyo in the city of Musashi- 
Murayama. 

An agreement reached on 3 August between 
Japan’s health ministry and the mayor of 
Musashi-Murayama clears the way for the 
facility to begin limited work with pathogens 
such as the Lassa and 


Je is set to join an elite club with its 


Ebola viruses. Japan's “The most 
National Institute likely impact 
of Infectious Dis- '§ that this will 
eases (NIID) built influence more 
the biosafety-level-4 nations tojoin 


(BSL-4) lab in 1981, this club.” 
but it has been lim- 
ited to operating as a BSL-3 lab because of 
community safety concerns. Fears that Ebola 
might reach Japan during last year’s outbreak 
in West Africa partly motivated the policy 
change. 

The deal sets several conditions for the lab’s 
activities: the NIID has committed to maintain 


2015 
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The number of nations with a top-level infectious-disease lab is set to grow as Japan joins the fold. 


Japan to upgrade 
biolab security 


Fears about Ebola helped forge deal to elevate existing 


transparency in reporting lab operations and 
any accident, and the lab must restrict its 
BSL-4 work to diagnosing and treating patients 
instead of running a broader research pro- 
gramme. However, virologist Ayato Takada at 
Hokkaido University in Sapporo, Japan, hopes 
that the agreement will ease the way for other 
facilities where scientists can perform basic 
infectious-disease research at the BSL-4 level. 
Discussions are under way to build a bigger and 
more modern BSL-4 lab at Nagasaki University 
—amove that has similarly met with commu- 
nity opposition. 

For now, Takada’s studies of Ebola virus must 
be done in collaboration with international 
partners; he flies to the United States several 
times a year to perform BSL-4 experiments. 
“Tt’s time-consuming and expensive,” he says. 
“T really hope the decision at Murayama will 
have a good influence on the likelihood of the 
Nagasaki University BSL-4 plan” 

Upgrading operations from BSL-3 to full 
BSL-4 status may take several months, says 
virologist Masato Tashiro, former director of 
the NIID’s Influenza Virus Research Center. 

The original BSL-4 infrastructure at 
Musashi-Murayama has been maintained 
over the decades, but new protocols will need 
to be established and staff trained for the 
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higher-security-level work. The lab will also 
need to import samples from other countries 
to build a reference library for diagnosing 
suspected infections. 

Typical BSL-3 labs include two sets of 
self-locking doors and directional airflow 
to prevent the escape of potentially lethal, 
airborne pathogens. But BSL-4 labs have 
extra features that protect the workers and 
prevent the escape of highly lethal microbes 
that cause infections for which there are no 
treatments or vaccines. 


EBOLA RESPONSE 

Over the years, the NIID made various 
attempts to gain public support for BSL-4 
operations at the site, but some citizens and 
politicians were concerned that the risks 
outweighed the benefits. That began to 
change last year as the Ebola outbreak raged 
in West Africa. Japan, like many other coun- 
tries, worried that the virus could cross its 
borders through international travel. 

Japan joins a growing club of nations 
that have considered or expanded BSL-4 
capabilities over the past decade — many 
spurred by the 11 September 2001 terror- 
ist attacks. The Federation of American 
Scientists in Washington DC, a non-profit 
organization specializing in security mat- 
ters, estimated in 2011 that roughly 40 
BSL-4 labs existed or were under construc- 
tion worldwide, although some researchers 
consider that an overestimate. 

“It was odd that a global player in 
science and a highly developed industrial- 
ized country such as Japan has not had that 
type of facility. Its brilliant that they have 
one now,’ says virologist Paul Duprex at 
Boston University School of Medicine in 
Massachusetts. 

Others, however, argue that BSL-4 labs 
serve only a small slice of infectious-disease 
research, and that existing facilities already 
exceed global needs. “BSL-4 facilities are 
fantastically expensive to construct and 
operate, and that comes at the cost of other 
areas of biomedical research,’ says molecular 
biologist and biosecurity specialist Richard 
Ebright at Rutgers University in Piscataway, 
New Jersey. Although the Musashi-Muray- 
ama lab itself is limited in scope, Ebright 
says, “the most likely impact is that this will 
influence more nations to join this club”. = 
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Most gay scientists 
are out in the lab 


Female-dominated fields found to be more welcoming. 


BY MARLA BROADFOOT 


(LGBT) scientists feel more accepted in 

the workplace than their peers in other 
professions, a US survey suggests. The study, 
published in the Journal of Homosexuality, 
also found that respondents in scientific fields 
with a high proportion of women, such as the 
social sciences, were more likely to be out to 
their colleagues than those in male-intensive 
disciplines such as engineering (J. B. Yoder 
and A. Mattheis J. Homosexuality http://doi. 
org/6vc; 2015). 

That result may suggest that laboratories 
with more women tend to be more receptive to 
people who do not fit the stereotype of a scien- 
tist as a straight, white man, says Jeremy Yoder, 
an evolutionary ecologist at the University of 
British Columbia in Vancouver, Canada, and 
co-author of the study. 

The Queer in STEM survey gauged the expe- 
riences of more than 1,400 LGBT individuals 
working in science, technology, engineering 
and mathematics (STEM) fields. The 58-item 
online survey included questions about gen- 
der and sexual identity, professional exper- 
tise and whether respondents felt that their 
workand social communities were welcoming 
or hostile. 

Other studies have indicated that out employ- 
ees feel more accepted at work, whereas those 
who are closeted or not open about their iden- 
tities are more likely to be stressed at work and 
have negative attitudes towards their work 
and co-workers (K. H. Griffith and M. R. Hebl 
J. Appl. Psychol. 87, 1191-1199; 2002). 

Participants rated their openness about their 
gender and sexual identities on a scale from 
0 (“Lam not out to anyone in this group”) to 
5 (“As far as ’'m aware, everyone in this group 


Li gay, bisexual and transgender 


could know”). The results indicated that the 
majority of the respondents (57%) were out to 
half or more of their colleagues, a greater pro- 
portion than the 47% of people who said that 
they were out in a 2014 survey of the general 
US workforce by the Human Rights Campaign 
Foundation in Washington DC. 

“There is reason to believe this difference 
could be real,” says Yoder. “In STEM workplaces 
you are working with a fairly well-educated set 

of co-workers, and 


“You are | you may very well be 
working with able to expect a more 
afairly well- open culture? 

educated set of Most respondents 
co-workers,and identified them- 
you may very selves as lesbian, gay, 
well be able to bisexual, transgen- 
expect amore der, queer or asexual. 


Some ticked more 
than one box, and 
5% wrote down an identity not included in 
the survey’s list, such as polyamorous, non- 
monogamous or pansexual. The researchers 
relied on social media and networking to 
recruit respondents, so they were concerned 
about the representativeness of the sample. A 
geographic analysis found that respondents 
came from US Census Bureau regions roughly 
in proportion to their estimated LGBT popu- 
lations, but survey respondents tended to be 
young and to work in the life sciences. 

“The study breaks new ground in providing 
a national portrait,’ says Kristen Renn, a social 
scientist at Michigan State University in East 
Lansing who studies LGBT college students. 
She adds that the findings “provide a sense of 
possibility” to LGBT and queer youth who are 
exploring careers. “For those wondering, ‘Can 
I be an openly queer chemist?” she says, the 
answer is: “Yes. And you wont be alone.” = 


open culture.” 
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China’s cement industry may produce less carbon than was thought. 


China’s emissions 
overestimated 


Analysis could force climate scientists to revisit estimates of 


global carbon cycle. 


BY JEFF TOLLEFSON 


( "sini carbon emissions may be 
significantly lower than previously 
thought — about 14% less in 2013 than 

estimated by the Chinese government and 

others, according to research published this 

week in Nature (Z. Liu et al. Nature 524, 335- 

338; 2015). The analysis draws on data from 

more than 4,200 Chinese mines — including 

new measurements of the energy content of 
coal — among other sources. 

“At the beginning of the project we thought 
that the emissions might be higher” than exist- 
ing estimates, says Zhu Liu, an ecologist at 
Harvard University in Cambridge, Massachu- 
setts, and lead author of the study. “We were 
very surprised.” 

His team’s findings do not unseat China from 
its position as the world’s largest emitter of car- 
bon dioxide. Even when the lower estimate is 
taken into account, China’s carbon output for 
2013 is still more than two-thirds higher than 
that of the United States, the second-largest 
emitter. But the study underscores long- 
standing uncertainties in the methods with 


which scientists calculate the emissions of indi- 
vidual nations, and how much carbon cycles 
through the atmosphere and into oceans and 
ecosystems. For comparison, the cumulative 
reduction in Chinese emissions outlined in the 
study — roughly 2.9 billion tonnes from 2000 
to 2013 — is larger than the estimated amount 
of carbon that the world’s forests pulled out of 
the atmosphere from 1990 to 2007. 

That presents a problem for researchers 
who study the carbon cycle. “We can easily go 
back and retroactively adjust Chinese carbon- 
emission estimates,” says Ashley Ballantyne, a 
climate scientist at the University of Montana 
in Missoula. “Unfortunately, we cannot go back 
and adjust all the previous studies on the global 
carbon cycle and their conclusions based on the 
previously biased emission estimates.” 

The Chinese government releases data on 
energy consumption and production at the 
provincial and national levels, but those sta- 
tistics often conflict with each other, and are 
revised frequently. Liu and his team analysed 
government data on energy production and 
on exports and imports of coal, oil and gas. 
They found that China’s fossil-fuel use was 
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10% above the official government estimate, 
but that the country’s overall emissions were 
lower once China's reliance on low-quality 
coal from domestic mines was taken into 
account. This is because lower-quality coal 
contains less carbon than higher-quality 
deposits, so burning it produces less energy 
and less heat-trapping CO,. 

The team says that its estimate for how much 
CO, will be produced by burning Chinese coal 
is around 40% less per unit than the figures 
adopted by the Intergovernmental Panel on 
Climate Change. And the team calculates that 
emissions from cement production, a coal- 
fuelled process that is a major contributor 
to global emissions, are 45% below existing 
estimates. 

The coal measurements were collected 
from mine reports and from a project spon- 
sored by the Chinese Academy of Sciences 
that assesses the country’s cumulative carbon 
emissions and carbon uptake by ecosystems 
across China. Liu says that the quality of 
Chinese coal is likely to be getting worse as 
the country burns through its best reserves. 

“This is probably the best available estimate 
of emissions from coal burning in China, and 
that is an important contribution,’ says Gregg 
Marland, a geologist at Appalachian State 
University in Boone, North Carolina, and a 
co-author of the study. But he adds that the 
revised figure is within the range of uncer- 
tainty reported in existing inventories. 

And it may need to be increased as the Chi- 
nese government releases further energy data, 
says Glen Peters, a climate-policy researcher 
at the Center for International Climate and 
Environmental Research in Oslo. Although 
China said in February that its coal consump- 
tion had dropped between 2013 and 2014, he 
notes, the government has since increased its 
cumulative estimate of coal consumption over 
the past decade by 12-14%. Scientists are still 
waiting for the government to release revised 

estimates of energy 


“With Chinese production, including 
energy imports and exports, 
statistics, over the past decade. 
there is always Liu says that his team's 
hyp? ” estimates are unlikely to 

a ‘but’. 
change when the latest 


data are released later 
this year, but Peters says that the figures may 
need to rise by as much as 7%. 

Such uncertainty arises in part from the 
many different ways to define and measure 
energy consumption; researchers do not 
know what kind of assumptions the Chinese 
government is making with its data. “If China 
reported their CO, emissions, then we would 
know the assumptions that they want to make 
and many of these issues would then go away,’ 
says Peters. The challenge, he adds, is unlikely 
to be resolved anytime soon. 

“With Chinese energy statistics,” he says, 
“there is always a ‘but?’ m 
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Superconductor high 
spurs wave of physics 


Researchers strive to reproduce and understand zero- 
resistance material that works at a record high temperature. 


BY EDWIN CARTLIDGE 


ydrogen sulfide — the compound 
Hssenst for the smell of rotten 

eggs — conducts electricity without 
any resistance at a record high temperature of 
203 kelvin (—70°C), reports a paper published 
on 17 August in Nature’. 

The work is a step towards finding a long- 
sought room-temperature superconductor. 
The first results were released on the arXiv 
preprint server in December’; more followed 
in June’. Both have already sparked a wave of 
follow-up calculations and experiments, not to 
mention excitement. 

The discovery is “historic”, says physicist 
Fan Zhang of the University of Texas at Dal- 
las, adding that its impact will be “far rang- 
ing” A superconductor that works at room 
temperature could improve electricity gen- 
eration and transmission, as well as giving a 
boost to current uses of superconductivity, 
such as in the huge magnets used in medical 
imaging machines. In a News & Views article* 
that accompanies the Nature paper online, Igor 
Mazin of the Naval Research Laboratory in 
Washington DC describes the finding as “the 
holy grail of superconductors”. 

The results are the work of Mikhail Eremets, 
Alexander Drozdov and their colleagues at the 
Max Planck Institute for Chemistry in Mainz, 
Germany. They find that when they subject 
samples of hydrogen sulfide to extremely high 
pressures — around 1.5 million atmospheres 
(150 gigapascals) — and cool them below 
203 K, the samples display the classic hall- 
marks of superconductivity: zero electrical 
resistance and a phenomenon known as the 
Meissner effect, in which a material is placed in 
an external magnetic field but there is no field 
inside the sample, unlike in normal materials. 


DEFYING CONVENTION 

Other scientists are intensely interested in the 
result because it was achieved without using 
exotic materials such as copper-containing 
compounds called cuprates, says physicist 
Christoph Heil of the Graz University of Tech- 
nology in Austria. Until now, these have held 
the record for the highest superconducting 
temperature — 133 K (-140°C) at ambient 
pressure and 164 K (-109°C) at high pressure. 


Many scientists had assumed that super- 
conductivity at temperatures of more than a 
few tens of kelvins required such materials. 
But Heil says that the pressurized hydrogen 
sulfide seems to be a ‘conventional’ super- 
conductor — vibrations in the material’s 
crystal lattice drive electrons to form ‘Cooper 
pairs’ that can flow through the crystal with- 
out resistance. And in calculations reported 
in April? , Matteo Calandra of the Pierre and 
Marie Curie University in Paris and his col- 
leagues found that the results could indeed 
be explained using a modified version of 
the conventional theory of low-temperature 
superconductivity based on lattice vibrations. 

For others, such theoretical analyses are 
superfluous until the result by Eremets and 
co-workers is confirmed experimentally 
by independent teams. Several are working 
towards that goal, including Katsuya Shimizu 
of Osaka University in Japan and his colleagues, 
who have seen the loss of resistance in pressur- 
ized hydrogen sulfide, but have yet to observe 
the Meissner effect. Meanwhile, three Chinese 
groups and one US team contacted by Nature 
say that they have yet to confirm either the elec- 
trical or magnetic effects. 

If Eremets and colleagues are proved right, 
then other compounds that pair hydrogen 
with, for instance, platinum, potassium, sele- 
nium or tellurium instead of sulfur may also 
be good candidates for high-temperature 
superconductivity, according to several other 
research groups. Taking a different tack, Zhang 
in Dallas and Yugui Yao of the Beijing Institute 
of Technology in China predict® that substi- 
tuting 7.5% of the sulfur atoms in hydrogen 
sulfide with phosphorus and upping the pres- 
sure to 2.5 million atmospheres could raise the 
superconducting transition temperature all the 
way to 280 K, which is above the freezing point 
of water. = 
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A model of the alpine skiing area displayed during Beijing’s bid to host the 2022 Olympic Winter Games. 


CONSERVATION 


Chinese biologists 
lead Olympics outcry 


Proposed alpine skiing area lies within nature reserve. 


BY DAVID CYRANOSKI 


( “pido hos of Beijing’s success in its 
bid to host the 2022 Olympic Winter 
Games have been tempered by protests 

from young biologists over the proposed alpine 

ski site. They say it falls within a protected 
national nature reserve that contains many rare 
species, including Beijing’s only Shanxi orchids 

(Cypripedium shanxiense). 

More importantly, say other biologists, the 
proposed site would violate environmental 
protection laws touted by the government, and 
create a precedent that would hamper already 
fraught efforts to conserve other, more signifi- 
cant sites. 

Local officials have since said that they will 
redraw the lines that delineate the reserve — but 
this has left the protesters unsatisfied. Mean- 
while, their online posts on the subject, to the 
Chinese social media website Weibo, are no 


longer visible; the protesters say they have been 
blocked. 

On 31 July, the International Olympic 
Committee, based in Lausanne, Switzerland, 
announced that Beijing would host the 2022 
Winter Olympics, beating its only rival, Almaty 
in Kazakhastan, by 44 votes to 40. But nature 
enthusiasts quickly noted that the proposed 
alpine skiing routes fall within the core area of 
the 4,600-hectare Songshan National Nature 
Reserve. On 1 August, Wang Xi, who recently 
received his PhD in plant biology and works at 
the Chinese Academy of Sciences in Shanghai, 
overlaid maps from the International Olympic 
Committee's evaluation report with those from 
the reserve’s website and posted the result on 
his Weibo account: both the start and end of 
the alpine runs fall within the reserve, he found 
(see ‘Olympic overlap’). 

Xi told Nature’s news team that his main 
motivation was to spread news of the possible 
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ecological impact on plants there, including on 
three orchid species that are classified at the 
highest protection level under Beijing's con- 
servation system. “It’s a chance for the govern- 
ment to connect with the people and talk to 
each other to solve this problem,” he says. “I 
am not against the Olympic Games, but they 
should be carried out in an environmentally 


friendly way.” 


WEIBO CLICKS 

Xi’s post was clicked some 240,000 times and 
forwarded more than 1,000 times, but within 
two days he found that it no longer appeared 
online. Word spread anyway. Yun Ji, a 28-year- 
old entomologist in Beijing who did not want 
his affiliation mentioned, had copied Xi’s 
maps. He added information on and pictures 
of the various plants found in the reserve that 
are on a list of the rarest and most rigorously 
protected of Beijing's plants, along with photos 
of birds whose habitat, he says, could also be 
destroyed. His Weibo post is now inaccessi- 
ble from the site — he claims it disappeared 
within four hours — but while it was online he 
says that it was forwarded 3,000 times, and one 
of those reposts was forwarded an additional 
16,000 times. 

Neither the Chinese Olympic Committee 
nor the Beijing government has released offi- 
cial statements responding to the allegations 
in the posts; in the International Olympic 
Committee evaluation report, the skiing area 
is described as being “adjacent to” the reserve. 
But on 7 August, the deputy mayor of Yanqing, 
the county in Beijing that has jurisdiction 
over the Songshan National Nature Reserve, 
announced that the borders of the reserve 
would be “adjusted” to take in land to the 
northeast of the current boundaries, and that 
some of the original reserve will be taken “to 
provide the necessary space for local sustain- 
able development and to promote interaction 
between ecological protection and economic 
society”. 

The details are not yet clear, but the proposed 
reserve is 31% larger than the original and does 
not overlap with the ski slopes, according to 
interviews with the deputy mayor published 
by Chinese media 


“Tamnot against outlets. The mayor's 
the Olympic office deferred que- 
Games, but ries from Nature’s 
they should be news team to the bid 
carriedoutinan committee, but the 
environmentally team did not receive 


a response from that 
committee nor from 
the China Olympic Committee. The Inter- 
national Olympic Committee has also not 
replied to queries from Nature’s news team on 
how it handles applications that have incon- 
sistencies, and whether conflict with a nature 
reserve would have affected its decision. 
According to 2013 Chinese govern- 
ment regulations, those who wish to change 


friendly way.” 
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Chinese biologists have 
shown that the area 
proposed for the alpine skiing 
event in the 2022 Beijing 
Olympic Winter Games lies 
within the core boundary of a 
nature reserve. 


Proposed 
ski park 
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Mainland 
China 


Existing road 


Olympic 
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nature-reserve boundaries must submit an 
application that includes public comment, an 
ecological assessment and four other docu- 
ments. Xi and Ji say that they can find no evi- 
dence that this happened. 

Lei Gu, a postdoctoral researcher in evolution 


Extent of proposed 
Olympic Park 


Boundary of core area 
of Songshan National 
Nature Reserve 


Edge of less / 
stringently protected 
buffer zone 


and conservation biology at Peking Univer- 
sity in Beijing, says that Songshan is not that 
important in terms of biodiversity. The bigger 
problem, he says, is that a government that 
has been increasingly issuing statements and 
regulations that emphasize its commitment to 
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environmental sustainability and conservation 
seems to be backing away from its promises. In 
May, for example, China's environment minis- 
try released a notice, signed by ten government 
agencies, that stated that any development at 
odds with a reserve's function is “strictly for- 
bidden”. Gu and Xi both worry that failure 
of the strict policies in Beijing would send a 
broader signal to local governments. If Beijing 
violates Songshan’s reserve, “it will be easier for 
local governments to give construction pro- 
jects higher priority than conservation issues’, 
says Gu, whose Weibo posts on the subject are 
no longer accessible. “The real impact’; he says, 
would be “the breaking of Chinese laws and 
policies on nature reserves”. 

Xi says that a better site for the alpine skiing 
event would be Zhangjiakou, a city in neigh- 
bouring Hebei province, which will also host 
some Winter Olympic events. Zhangjiakou 
has mountains that are already developed and 
would suffer no environmental loss, he says. = 


CLARIFICATION 

The News story ‘Stanene makes its debut’ 
(Nature 524, 18; 2015) should have made 
it clear that the stanene was made by 
experimentalists at Shanghai Jiao Tong 
University. 
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n 2013, Beau Kilmer took on a pretty 

audacious head count. Citizens in the 

state of Washington had just voted to 

legalize marijuana for recreational use, 
and the state’s liquor control board, which 
would regulate the nascent industry, was 
anxious to understand how many people were 
using the drug — and importantly, how much 
they were consuming. 

The task was never going to be straight- 
forward. Users of an illicit substance, parti- 
cularly heavy users, often under-report the 
amounts they take. So Kilmer, co-director 
of the RAND Drug Policy Research Center 
in Santa Monica, California, led a team to 
develop a web-based survey that would ask 
people how often they had used cannabis in 
the past month and year. To help them gauge 
the amounts, the surveys included scaled 
pictures showing different quantities of 
weed. The survey, along with other data the 
team had collected, revealed a rift between 
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perception and reality. Based on prior data, 
state officials had estimated use at about 
85 tonnes per year; Kilmer’s research sug- 
gested that it was actually double that, about 
175 tonnes’. The take-home message, says 
Kilmer, was “we're going to have to start 
collecting more data’. 

Scientists around the world would echo 
that statement. Laws designed to legalize 
cannabis or lessen the penalties associated 
with it are taking effect around the world. 
They are sweeping the sale of the drug out of 
stairwells and shady alleys and into modern 
shopfronts under full view of the authorities. 
In 2013, Uruguay became the first nation to 
legalize marijuana trade. And several coun- 
tries in Europe — Spain and Italy among 
them — have moved away from tough penal- 
ties for use and possession. Thirty-nine US 
states plus Washington DC have at least some 
provisions for medicinal use of the drug. 
Washington, Colorado, Alaska and Oregon 


Retailers in Colorado 
can now sell cannabis 
to the public. 


have gone further, 
legalizing the drug 
for recreational con- 
sumption. A hand- 
ful of other states including California and 
Massachusetts are expected to vote on similar 
recreational-use measures by the end of 2016. 

But the rapid shift has caught researchers 
on the back foot. “Broadly speaking, there’s 
about 100 times as many studies on tobacco 
or alcohol as there are on illegal substances,” 
says Christian Hopfer, a psychiatry researcher 
at the University of Colorado School of Medi- 
cine in Denver. “I don’t think it’s the priority 
it should be” 

Despite claims that range from its being a 
treatment for seizures to a cause of schizo- 
phrenia, the evidence for marijuana’s effects 
on health and behaviour is limited and at 
times conflicting. Researchers struggle to 
answer even the most basic questions about 
cannabis use, its risks, its benefits and the 
effect that legalization will have. 

The quick shifts in policies should provide 
a plethora of natural experiments, but the 
window will not be open for long. “There's 
an opportunity here. Some of the most 
informative research we can do is right at the 
moment the market changes,” says Robert 
MacCoun, a social psychologist and public- 
policy researcher at Stanford Law School in 
California who worked with Kilmer on the 
research done in Washington. 


WHAT ARE THE NEGATIVE EFFECTS? 

For years, the debate over the drug’s safety 
has been polarized. Those seeking legaliza- 
tion claim that it is basically harmless. Yet 
governments around the world have placed 
cannabis among the most-dangerous illegal 
drugs, running hard-hitting campaigns warn- 
ing of the threats that it poses to mental health 
and social well-being. 

Scientists are fairly sure about some things, 
particularly when it comes to the short- 
term effects. They know, for instance, that 
it impairs memory and coordination, and 
can cause paranoia and psychosis’. These are 
some of the classic symptoms of being ‘high 
and can have major health effects in and of 
themselves. Studies have found, for exam- 
ple, that drivers are between twice and seven 
times as likely to crash if they have recently 
smoked the drug”. 

In the longer term, effects are less clear, but 
there are a few that most scientists agree on. 
Contrary to many popular arguments, there 
is evidence that cannabis is addictive. Around 
9% of users become dependent on the drug, 
showing signs of addiction such as develop- 
ing tolerance or experiencing withdrawal 
symptoms when they stop using. Beyond that, 
however, long-term effects have been difficult 
to pin down. 

Cannabis is often smoked, and this can 
raise the risk of respiratory problems and 
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possibly lung cancer. A 2008 study in New 
Zealand found that smoking pot increased 
the risk of lung cancer by 8% for each ‘joint- 
year’ (the equivalent of smoking a joint per 
day for one year), even after taking tobacco 
use into account’. But other studies have 
found little to no correlation with lung cancer, 
even for heavy users*. 

Other health outcomes are even more diffi- 
cult to disentangle from confounding factors. 
Some researchers have found links to poor 
educational performance, low social attain- 
ment — such as job status — and altered brain 
development. For example, the Christchurch 
Health and Development Study — which 
followed almost 1,300 children born in New 
Zealand in 1977 — found that people who 
used cannabis daily are around 50% more 
likely to have psychotic symptoms’ than are 
non-users and are at greater risk of not finish- 
ing school’. 

And another study from New Zealand, 
which followed 1,000 people in Dunedin 
from birth to age 38, shows that persistent 
cannabis use, especially if started young, 
correlates with steeper declines in IQ in later 


“Marijuana 
research is 

like tobacco 
research in the 
60s. Any study 
about harms is 
challenged.” 


life and with problems with memory and 
reasoning compared with people who have 
never used the drug’. 

Hall says that the association with nega- 
tive social and mental-health outcomes has 
been consistently observed, but the debate 
“is how we explain that association’, which 
he says will probably involve a combination 
of factors. 

The difficulty, says Valerie Curran, a 
psychopharmacologist at University College 
London, lies in teasing apart correlation and 
causation, because “there are so many con- 
founders”. For example, adolescents who use 
cannabis are probably also drinking excessive 
amounts of alcohol and engaging in other 
risky activities. Attributing the effects to one 
particular substance or behaviour is therefore 
very difficult. 

Similar problems abound in the hotly 
contested link between cannabis and schizo- 
phrenia. Multiple studies have shown an 
increased risk of this mental-health disorder 


20 AUGUST 2015 | VOL 524 | NATURE | 281 


© 2015 Macmillan Publishers Limited. All rights reserved 


| NEWS FEATURE 


in people who use cannabis versus people 
who do not. A study of 50,000 Swedish men 
aged 18-20 found that heavy users were 
around three times more likely to develop 
schizophrenia than those who had never 
used the drug’. Although the increase in 
risk was significant, the overall risk is still 
low — just 1.4% of men who reported using 
cannabis developed the disorder, compared 
with 0.6% of those who said they had never 
tried the drug. Some cannabis advocates sug- 
gest that the link may be down to people with 
such problems ‘self-medicating’, but this is 
difficult to prove. 

Many of the negative health outcomes 
seem to be exacerbated if the drug is used 
in adolescence, leading to suggestions that 
cannabis is adversely affecting developing 
brains. And effects may also be linked to the 
drug’s potency, which in itself is hard to pin 
down. 

As cannabis use becomes legal, the data 
may become easier to collect. But the drug’s 
use is still low compared with alcohol and 
tobacco, says Wayne Hall, an addiction 
researcher at the University of Queensland 
in Brisbane, Australia, so it is hard to draw 
firm conclusions. Marijuana may be the 
most popular illegal drug, he says — about 
44% of US adults have used it at some point 
in their lives according to one source — but 
only about one in ten have used it in the past 
year. By contrast, around 70% drank alcohol 
in that time. “The number of people who use 
it with any regularity for a long time is pretty 
small. The longer-term consequences are 
really understudied,” says Hall. 


HOW STRONG IS IT? 

A major question for researchers — and 
a complication in interpreting the evi- 
dence — is dosing. There are more than 85 
cannabinoid chemicals in pot. The one of 
most interest to researchers — and users — is 
tetrahydrocannabinol (THC). Growers have 
been able to breed high concentrations of the 
chemical into strains of the plant meant for 
recreational and medicinal use. A potency- 
monitoring programme run by the University 
of Mississippi for the US National Institute on 
Drug Abuse (NIDA) found that THC levels 
have steadily increased in the United States", 
from 2-3% in 1985-95 to 4.9% in 2010. The 
increase is even starker for imported cannabis 
seized by law-enforcement officials. For these 
drugs, potency has gone from less than 4% in 
the late 1980s and early 1990s to more than 
12% in 2013. 

But it is hard to determine the amounts of 
THC being consumed by the average cus- 
tomer. It is unclear, for example, whether 
users ‘titrate’ their doses, adjusting their 
intake according to the potency. Nicotine 
users are known to do this with cigarettes, 
but nicotine does not impair judgement in 
the same way that cannabis does. And the 


Reefer madness 


After more than a century of fairly lenient 
laws surrounding cannabis, public attitudes 
in the United States and elsewhere shifted 
against the drug in the early 1900s. 
Anti-marijuana propaganda escalated in the 
1930s, with films depicting the horrors that 
befell users (see right). 


19/6 


The Netherlands decriminalizes possession, 
use and sale of small quantities of marijuana. 


1996 


California becomes the first state to legalize 
medical cannabis, through Proposition 215. 


2012 


Washington and Colorado vote to legalize 
recreational marijuana for adults aged 21 or 
older, although the drug does not become 
available for sale until 2015. 


NOT RECOMMENDED FOR 
o> 


Survey data have revealed a dramatic shift in attitudes towards marijuana in the United States. 
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effects of THC are less immediate, especially 
for edible forms. 

The escalating potency raises questions 
about previous research because users in 
older studies may have been consuming 
lower-potency cannabis, and the effects 
may be different (see “Research gaps’). A 
study published earlier this year, for exam- 
ple, linked high-potency cannabis to a 
threefold-increased risk of psychosis ver- 
sus non-use but found no association with 
lower-potency forms’”. And many research- 
ers have complained that the pot approved 
for study in experiments funded by NIDA is 
a poor match for what is used recreationally 
or medicinally. 

In tandem with changing laws, the Colorado 
Department of Public Health and Environ- 
ment (CDPHE) is establishing reference labs 
to check the potency of what is sold. And the 
US government is expanding the varieties of 
marijuana that researchers with federal fund- 
ing can obtain. 

In places where the drug is legal, existing 
labelling standards may also be inadequate. A 
survey done between August and October last 
year found that only 17% of edible cannabis 
products in San Francisco, Los Angeles and 
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1990 


1995 2000 2005 2010 2015 
Seattle had accurate labels. More than half 
had less THC than claimed, and some con- 
tained significantly more’’. “A lot of people 


get a rude surprise,’ says MacCoun. 


ARE THERE MEDICAL BENEFITS? 
Although states are starting to ease restric- 
tions on recreational use of marijuana, what 
got the ball rolling in changing public percep- 
tions and the legal landscape for pot were the 
arguments for its medical use. 

Colorado introduced its rules allowing 
medical marijuana more than a decade before 
it allowed recreational use. The amendment 
to the state’s constitution listed eight condi- 
tions for which marijuana was approved: 
cancer, glaucoma, HIV/AIDS, cachexia (a 
progressive wasting syndrome), persistent 
muscle spasms, seizures, severe nausea and 
severe pain. But, says Larry Wolk, execu- 
tive director and chief medical officer of the 
CDPHE, “those are dictated by the constitu- 
tion and not necessarily by medical research”. 

Although there is a huge amount of 
anecdotal evidence — and well-organized 
advocacy groups that campaign for easier 
access to medical marijuana — there is little 
conclusive scientific evidence for many of the 
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claimed medical benefits. One of the reasons 
for this dearth of evidence is that money gen- 
erally has been obtainable only for research 
on the negative effects of cannabis. That is 
beginning to change. 

When Colorado first legalized the drug, 
its public-health department began col- 
lecting fees from patients who applied to 
purchase pot at medical dispensaries. By 2014, 
the state had amassed more than US$9 mil- 
lion, most of which was ploughed back into 
a medical marijuana research programme 
selected by the CDPHE. Among the projects 
funded by the Colorado millions, there are 
two investigating whether cannabinoids 
can help to mitigate seizures in childhood 
epilepsy. Similar research is being pursued 
in the United Kingdom and elsewhere in the 
United States. 

Another, more-established use is for people 
with multiple sclerosis. A cannabis-based 
spray has been approved in 27 countries for 
treatment of muscle problems associated with 
the disease, such as spasms. 

Other claimed benefits of marijuana, such 
as boosting appetite in people with AIDS, 
are supported by more-limited evidence. If 
positive effects can be clearly demonstrated, 
it would be a huge vindication for marijuana 
advocates. It might also go some way towards 
justifying medical-marijuana legislation. 

In the meantime, however, scientists are 
watching the emerging cannabis frontier with 
wary eyes. “I think it’s an experiment,” says 
Robert Booth, a psychiatry researcher at the 
University of Colorado. “When this study is 
all said and done, we'll know a whole lot about 
the effects of marijuana.” 


WHAT HAPPENS WHEN YOU MAKE IT LEGAL? 
One of the biggest questions is how legaliza- 
tion will change usage patterns. One place in 
which researchers are looking for answers 
is Europe, where cannabis regulation tends 
to be much lighter than it is in the United 
States (see “Reefer madness’). In the United 
Kingdom, some police forces overlook can- 
nabis use and small-scale growing operations. 
Spain allows private consumption, but still 
has restrictions on sales. 

The most extreme and long-standing 
example is the Netherlands, which decrimi- 
nalized the possession and sale of small quan- 
tities of cannabis in 1976. But although some 
streets of Amsterdam have been transformed 
into pungent tourism hotspots, the country 
as a whole has not changed its habits much. 

Although hard data on cannabis use in 
Europe is patchy, the Netherlands does not 
have hugely more users than other nations. 
Data aggregated by the United Nations Office 
on Drugs and Crime put use in the Nether- 
lands at about 7%. That is more than in Ger- 
many (5%) and Norway (5%), about the same 
as in the United Kingdom and less than in the 
United States (15%). Nor has the Netherlands 


Research gaps 


Published research on marijuana lags behind that of 
other drugs, such as alcohol and cocaine, according 
to a search of the bibliographic database Scopus. 
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The amount of tetrahydrocannabinol (THC) in 
cannabis has risen sharply in the past three decades, 
raising concerns about the relevance of older research 
and studies using less-potent varieties of the drug. 
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seen a huge spike in use of harder drugs, 
dampening fears that marijuana serves as a 
gateway to more-dangerous substances such 
as heroin and cocaine. The message from the 
Netherlands, says Franz Trautmann, a drugs- 
policy researcher at the Trimbos Institute in 
Utrecht, the Netherlands, is that “a very liberal 
policy doesn’t lead to a skyrocketing preva- 
lence”. Rather, cannabis is endemic, he says. 
“We can’t control this through prohibition. 
This is something which more and more is 
recognized.” 

But the lesson from the Netherlands may 
be limited because the drug is still illegal, 
and growing and selling large quantities is 
still punishable by law. Colorado has gone 
further by legalizing not merely the drug’s 
use, but the whole production chain, and that 
could have fundamentally different effects 
on the economics of pot. “Legalized produc- 
tion really raises the prospect of a dramatic 
drop in price,” says MacCoun. “It’s conceiv- 
able marijuana prices could drop 75-80% in 
a fully legalized model.” (Although Uruguay 
legalized the drug in 2013, it reportedly has 
struggled to regulate production and to set up 
working dispensaries.) 

The effects of a sharp drop in cost are 
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unknown. Taxation may also have unintended 
consequences. If states tax by weight, users 
might look to higher-potency strains to save 
money. And once cannabis is a business, it 
gains a business lobby. Cannabis researchers 
already talk of being bombarded with e-mails 
from pro-cannabis groups if they make nega- 
tive comments about the drug. “Marijuana 
research is like tobacco research in the 60s,” 
says Hopfer. “Any study about harms is chal- 
lenged. It’s really something” Many fear that 
the big money now to be found in cannabis 
will drive attempts to obfuscate the risks. 
“If the commercial interests are too big, then 
the profit interest is prevailing above the 
health interest. This is what I’m afraid of} says 
Trautmann. 

Legalization provides an opportunity to 
answer some important questions. In a few 
years, Colorado, Washington and others 
will know (if only roughly) how legaliza- 
tion affects usage patterns, the number of 
car crashes and the number of people seek- 
ing help for drug dependency. The CDPHE- 
funded programmes will have added to the 
knowledge of beneficial effects. And continu- 
ing long-term studies of large groups of users 
will provide more evidence for statisticians 
who are attempting to disentangle correlation 
and causation on the negative impacts. 

“When a jurisdiction changes its mari- 
juana laws, that provides an opportunity for 
greater leverage on the questions of cause and 
effect,” says MacCoun. But, he adds, the sig- 
nals will only really be clear if the laws result 
in a dramatic increase in use — something 
that is neither a given, nor necessarily desir- 
able. “Obviously, we don’t want marijuana 
use to rise just to allow us to answer our 
questions, but if it does, we'll be poring over 
all the data” m 


Daniel Cressey writes for Nature from London. 
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New Orleans flooded and burning, the week after Hurricane Katrina. 


Hurricane Katrina’s 
lessons for the world 
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Ten years after the devastation of the US Gulf coast, Edward B. Barbier calls for coastal 
protection plans like that adopted by Louisiana for the world’s most vulnerable nations. 


r | Yen years ago, Hurricane Katrina 
devastated the US Gulf coast states. 
It caused around US$110 billion 
in damages, more than 1,800 deaths, and 
displaced 1.2 million people’. 

The disaster led to a rethink of the 
management of the Gulf coastline. In the 
seven decades preceding 2005, Louisiana 
had lost coastal lands, mainly marshes, 
totalling around 4,900 square kilometres 
—amnarea the size of Trinidad and Tobago’. 


Following the hurricane, the President’s 
Gulf Coast Ecosystem Restoration Task 
Force recommended extensive wetland 
re-establishment, noting that they “provide 
a natural flood attenuation function”. 
After the hurricane season in 2005, the 
Louisiana state legislature created the 
Coastal Protection Restoration Author- 
ity (CPRA) and tasked it with coordinat- 
ing the local, state and federal efforts. 
The CPRA embarked on five-year coastal 


master plans to guide policymakers in 
developing a more sustainable coast. 
I served on the science and engineering 
board that oversaw the scientific analysis 
for the 2012 master plan. This plan was 
charged with evaluating, prioritizing and 
integrating coastal protection and restora- 
tion projects over the next 50 years, with 
a total budget of $50 billion. The plan 
accounts for how coastal restoration and 
protection in Louisiana may have to > 


20 AUGUST 2015 | VOL 524 | NATURE | 285 


© 2015 Macmillan Publishers Limited. All rights reserved 


COMMENT 


15 MOST-AT-RISK NATION 


40% of Brazilians in 
rural low-lying coastal 
areas are living in 
poverty. 


> adapt to differing climate-change and 
sea-level-rise scenarios. 

The 2012 plan is not perfect. Oyster, 
shrimp and crab harvesters, landowners, 
waterfowl hunters and shipping interests have 
expressed concerns about river-diversion 
projects, delaying them. And the marsh-cre- 
ation plans are ambitious: around 100 square 
kilometres of new wetlands from 2012 to 2031 
(ref. 2). Combined with the building of levees 
and other structures, however, these efforts 
lower the damage costs associated with the 
risk of flooding along Louisiana's coast by 
$5.3 billion to $18 billion annually’. 

Flaws and merits aside, Louisiana's 50-year 
coastal-planning strategy represents a new 
way of thinking about the long-term man- 
agement of coasts. Resilience is the new 
aim — against short-lived natural disasters 
that have immediate and often extreme 
impacts, such as flooding and storm surges, 
and against long-term climatic changes that 
have more gradual impacts, such as sea-level 
rise, saline intrusion and erosion. 

Other parts of the world urgently need 
such along-term strategy. Around 38% of the 
global population — 2.5 billion people — live 
within 100 kilometres of the coast. More than 
three-quarters of these people are in develop- 
ing countries. The frequency of flooding asso- 
ciated with cyclones and other major coastal 
storms is likely to increase by the 2050s as a 
result of accelerated sea-level rise*. Coastal 
areas are the front lines of climate change. 

The most vulnerable are poor, rural pop- 
ulations in developing countries that live 


Almost 60 million people live in poverty in the low-lying 
rural areas of these countries’ coastlines. 


Half of Indians in 
rural low-lying 

coastal areas are 
living in poverty. 


All of Nigeria’s coastal 
agriculture is vulnerable 
to storm intensification 
and sea-level rise. 


One-fifth of 
Bangladesh’s coastal 


40% of China’s 
coastal wetlands are 
vulnerable to storm 
intensification and 
sea-level rise. 


gross domestic product 


and agriculture are 
vulnerable to storm 
intensification and 
sea-level rise. 


less than 10 metres above sea level, in low- 
elevation coastal zones (LECZs). In 2010, 
around 267 million people lived in the rural 
areas of LECZs. By 2100, the figure is pro- 
jected to be 459 million (ref. 5). 

Almost all of the world’s 60 million poor 
people living in rural LECZs reside in just 
15 countries: India, Bangladesh, Myanmar, 
Cambodia, Nigeria, Pakistan, Iraq, Mozam- 
bique, Senegal, Brazil, China, Indonesia, 
the Philippines, Vietnam and Thailand (see 
‘15 most-at-risk nations’). Many of these 
people depend on agriculture and fishing, 
and rely on natural barriers such as man- 
groves for protection against storms and 
other coastal hazards that are set to intensify 
with climate change. Yet, as happened on the 
Gulf coast, these barriers are being eroded 
(see Supplementary Information; go.nature. 
com/3mxhl}). 

These 15 nations should be the priority 
for a long-term global planning strategy 
to protect coasts and populations that are 
vulnerable to damaging storms. 


THREE LINES 

Consider Bangladesh. The current capacity, 
infrastructure and coping strategies of its 
coastal communities are inadequate to 
deal with current storms, let alone any of 
increased intensity. Over the next 35 years, 
the area of the coastline that is vulnerable 
to flooding of deeper than 3 metres could 
increase by more than 69%; the zones 
vulnerable to floods that are deeper than 
1 metre could grow by 14%°. 
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A one-off investment of an estimated 
$2.4 billion is needed to cope with these 
hazards in Bangladesh from now until 2050, 
with extra maintenance and operating costs 
of $50 million per year. These sums are half 
the projected financial costs of future dam- 
ages, deaths and injuries from the increased 
hazards associated with climate change’. 

A long-term global resilience strategy 
for coasts needs three lines of defence. 
First is the restoration of ‘gree infrastruc- 
ture — robust coastal ecosystems, such as 
salt marshes, oyster and coral reefs, man- 
groves, seagrass beds and beaches. These are 
essential, especially in remote, rural parts of 
developing countries”*. Second is the right 
‘grey infrastructure, such as seawalls, dikes, 
barrages and diversions as well as raising the 
elevation of buildings and flood-proofing 
structures. Third is to involve key local 
stakeholders throughout; this builds insti- 
tutional and coastal community capability. 

This third element — engagement, par- 
ticipation and empowerment — must not 
be overlooked. A good example of engage- 
ment is India’s Joint Mangrove Management 
programme that, with the help of non- 
governmental organizations, enlists local 
communities in conservation and affores- 
tation efforts’. Similarly, Louisiana's Coastal 
Master Plan was developed in consultation 
with state legislators, coastal parish officials, 
community groups, business leaders, advo- 
cacy organizations and Louisiana scientists. 
A series of public meetings attracted more 
than 750 people to comment on the plan; 


DESHAKALYAN CHOWDHURY/AFP/GETTY 


Indian villagers participate in mangrove restoration. 


more than 2,200 comments came in online, 
by e-mail and by post”. 

In the long term, community development 
will also need to include policies and invest- 
ments that drive livelihood diversification 
— moving away from coastal resource-based 
activities such as fishing and agriculture 
towards manufacturing and services. And 
these communities need investments and 
improvements in health, education, local 
governance and institutions. As a first step 
towards determining which long-term 
investments should be a priority in Kenya, 
Madagascar, Mauritius, the Seychelles and 
Tanzania, for example, 
assessments have been 
conducted of coastal 
communities’ vul- 
nerability and adapt- 
ability to the main 
impacts of climate 
change on coral-reef 
fisheries’. 

Finally, the most 
vulnerable popula- 
tions may need to be encouraged to migrate 
to non-coastal areas. Schemes such as vol- 
untary property acquisition or assisted 
relocation may be required, as was recom- 
mended by the Louisiana master plan’. In 
Bangladesh and other developing countries, 
improving roads in remote, rural LECZs 
might be the best option to encourage vol- 
untary migration from areas of high flood 
risk to safer areas with better employment 
opportunities®. 


ADAPTATION FUNDS 

The 2012 Louisiana master plan has 
demonstrated that it is possible to put all 
three lines of coastal defence in place — 
green, grey and social. But it has done so 
in one of the richest nations in the world. 
Developing countries will clearly need inter- 
national support to create and implement 
such plans. For instance, the Global Envi- 
ronment Facility and the United Nations 
Development Programme (UNDP) are 
already collaborating with the government 
of Bangladesh in support of its five-year, 
$8.5-million Community-based Adaptation 
to Climate Change through Coastal A ffores- 
tation programme’. 

Such long-term planning efforts and 
capacity-building should be financed 
through international climate-change 
adaptation funds. This could be done on 
a cost-sharing basis with relevant govern- 
ments and local jurisdictions in coastal 
zones. Currently, around $13.4 billion in 
the UN’s Green Climate Fund as well as the 
main adaptation funds have been pledged to 
assist developing countries adapt to climate 
change. Just one quarter of that could make 
rural LECZs more resilient, and improve the 
livelihoods of nearly 60 million poor people 
currently living in these zones. 

The next step should be for the COP21 
UN climate summit in Paris this December 
to earmark multilateral adaptation funds 
to support developing countries in their 
efforts to devise a long-term global plan- 
ning strategy for enhancing and protecting 


coasts and populations vulnerable to dam- 
aging storms. The World Bank, UNDP and 
the UN Environment Programme should 
also commit to assisting local and national 
authorities through a coordinated effort. m 


Edward B. Barbier is professor of 
economics at the University of Wyoming in 
Laramie, Wyoming, USA. His book Nature 
and Wealth: Overcoming Environmental 
Scarcity and Inequality will be published 
this year. 

e-mail: ebarbier@uwyo.edu 
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Sensory support is helping this man with autism spectrum disorder to connect with his therapist. 


Seeing the 
spectrum entire 


Chris Gunter examines a comprehensive history of the 
science and culture surrounding autism studies. 


ithin two generations, the popular 
and scientific understanding of 
autism spectrum disorder (ASD) 


and related conditions have undergone a 
massive shift in some parts of the world. We 
have moved from routine institutionalization 
(or worse) of people with ASD to an apprecia- 
tion of a spectrum of social communication. 
How did this sea change come about? 
Journalist Steve Silberman has been writing 
and commenting on autism for years, nota- 
bly with a 2001 feature in Wired magazine 
on ASD rates in California's Silicon Valley. 
He has compiled his exhaustive research into 
NeuroTribes to try to answer that question. 
Genetic and neurological studies now 
firmly suggest that ‘autism’ describes a con- 
stellation of behaviours. This is defined in 
the fifth edition of the American Psychiat- 
ric Association’s Diagnostic and Statistical 
Manual of Mental Disorders (DSM-5, 2013) 


as “persistent difficulties in the social use of 
verbal and nonverbal communication” and 
“restricted, repetitive patterns of behav- 
ior, interests, or activities” In 2014, the US 
Centers for Disease Control and Preven- 
tion estimated that 1 in 68 US children had 
ASD. Co-occurring 
challenges abound — 
including anxiety and 
digestive problems — 
but only about half of 
people diagnosed with 
ASD also have intellec- 
tual disability. We now 
recognize that autis- = 
tic people can have a 
hugely varying range 
of impairments and 
abilities, which can 
change throughout life. 

This recognition 


NeuroTribes: The 
Legacy of Autism 
and the Future of 
Neurodiversity 
STEVE SILBERMAN 
Avery: 2015. 
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has been hard won. Silberman takes us back 
to the early twentieth century and clinics in 
Europe, Russia and the United States that first 
began using the terms autismus or autistic 
psychopathy (from the Greek word autos, 
meaning ‘self”) to describe apparent with- 
drawal into an inner world. 

The clinicians most often credited with 
discovery are Hans Asperger and Leo Kanner. 
Asperger's clinic in 1930s Vienna embraced 
the full range of ASD. But Silberman asserts 
that to protect his charges from euthanasia by 
the Nazis, Asperger focused his case reports 
on gifted children ostracized by their peers, 
later termed high-functioning. Eventually, 
these cases would be called Asperger’s syn- 
drome; in the DSM-5, controversially, this 
diagnosis is folded into ASD. 

Kanner wrote a 1943 paper about his 
patients in Baltimore, Maryland, which is 
generally considered the first description of 
autism (Nerv. Child 2, 217-250; 1943). He 
focused on “extreme autistic aloneness” and 
“an anxiously obsessive desire for the main- 
tenance of sameness”. This restricted view 
would shape the field for five decades. 

With child psychologist Bruno Bettelheim, 
Kanner did further damage in the 1940s and 
1950s by popularizing the unfounded idea 
that loveless “refrigerator” parenting caused 
autism. The pair suggested that children with 
autism be institutionalized ‘for their own 
good. Silberman provides ample biographi- 
cal detail on many such personalities in early 
autism research, as well as Bernard Rimland, 
who founded the US National Society of 
Autistic Children (now the Autism Society) 
in 1965 and pioneered the involvement of 
parents in support networks and the search 
for treatments and educational strategies. 

As he must, Silberman discusses the dis- 
astrous impact of the now-retracted 1998 
paper in which UK surgeon Andrew Wake- 
field (since struck off the medical register) 
alleged a link between ASD and the mea- 
sles, mumps and rubella vaccine. Silber- 
man demolishes the study’s claims, long 
dismissed as fraudulent and debunked by 
massive longitudinal studies. 

Instead, rising rates of ASD diagnosis in 
the past four decades (US prevalence was 
estimated at 1 in 5,000 in 1975) can be tied 
to — but not wholly explained by — the use of 
expanded diagnostic criteria from the 1980s. 
UK clinician Lorna Wing and her colleagues, 
whose revisions to the DSM-III were adopted 
in 1987, fully expected to see a rise in diagno- 
ses, hearkening back to Asperger's 1944 defi- 
nition of “autistic psychopathy’; in which he 
described his observations as “not at all rare”. 
Wing adopted the term autism spectrum, 
recalling the words of statesman Winston 
Churchill: “Nature never draws a line with- 
out smudging it” 

The rise in diagnoses is also linked to 
awareness increased by films featuring autistic 


ABK/BSIP/SPL 


JAMES LEYNSE 


characters, such as Rain Man (1988), 
as well as the advent of the Internet and 
social media, which enabled the formation 
of support groups and allowed communi- 
cation in ways that can be easier for peo- 
ple with ASD. As some of the stigma has 
ebbed, parents have become more willing 
to seek diagnoses for their children and 
even themselves. 

One of the most powerful talks at the 
2015 International Meeting for Autism 
Research was by John Elder Robison, 
who has been diagnosed with Asperger's 
syndrome. He told the room of scientists 
and clinicians, 


“The reason diag- “As the stigma 
nosticlabelsareso lias ebbed, 
important to those parents have 
ofuswithautismis become more 
that without those willing to seele 


labels, we only have 
the labels we got 
in the streets, which are hateful”? While 
we debate diagnostic criteria, we must 
remember that there are people receiving 
these labels (and not) who deserve respect, 
understanding and support. 

Fittingly, Neuro Tribes ends with the 
neurodiversity movement that is now 
emerging. Groups such as the Autism 
Self Advocacy Network campaign for 
spending less on finding ‘cures’ and more 
on designing appropriate support and 
accommodation for people with ASD 
and their families and caregivers. 

Any work on ASD will be seen as a 
triumph by some and a travesty by others; 
Silberman has opted not to be shy. Great 
contributions include interviews with 
prominent scientists and self-advocates. 
He intersperses these with his own opin- 
ion, for example portraying the DSM-III 
as a way for psychiatrists to link to “Big 
Pharma’. His affection for detail can get 
in the way, as in a chapter on the devel- 
opment of radio and electronic bulletin 
boards. Readers should look elsewhere 
for a primer on what we do and do not 
understand of the basic biology of ASD; 
that is not Silberman’s intent. 

NeuroTribes is no casual read. The par- 
ent ofa newly diagnosed child looking for 
information may be put off by the weight 
of past horrors on show. But for people in 
the field, or anyone seeking to understand 
the interplay between medical science 
and patient communities, it should be an 
essential resource. My own copy is already 
dog-eared and underlined throughout. m 


diagnoses.” 


Chris Gunter is director of 
communications for the Marcus Autism 
Center at Children’s Healthcare of 
Atlanta in Georgia. She is also at Emory 
University School of Medicine in Atlanta. 
e-mail: drchrisgunter@gmail.com 
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Bioethics on stage 


Brendan Maher reviews a play inspired by a famous 
clash between geneticists and a Native American tribe. 


State University (ASU) in Tempe collected 
blood samples from the Havasupai people, 
an isolated Native American tribe living in the 
Grand Canyon. A decade later, tribe members 
sued the university for misusing the samples, 
on the grounds that they had not been fully 
informed of the study’s scope. The legal bat- 
tle culminated in a US$700,000 payout from 
ASU; the remaining samples were returned. 
That much is true. But much else about the 
story is disputed. It has surfaced in articles 
and books as an example of the cultural sen- 
sitivities that come up in research. Some argue 
that it has become a fable of arrogant scien- 
tists riding roughshod over people's rights 
—atale that has hardened mistrust between 
researchers and Native American groups. 
The latest retelling is the drama Informed 
Consent, playing at the Duke on 42nd Street 
in New York City. Seasoned playwright 
Deborah Laufer sees her work as an explora- 
tion of truth and the clash between science 
and religion. Although the scientist at its cen- 
tre comes across as a caricature of hubris, the 
piece effectively presents some ethical, legal 
and social complexities of modern genomics. 
It is highly fictionalized. The real 
researchers were anthropologist John Martin 
and geneticist Therese Markow, who in the 
1990s were trying to find genetic underpin- 
nings for the Havasupai’s high incidence 
of type 2 diabetes — roughly 55% among 
women and 38% among men. In the play, 
Ken (played by Jesse Perez) is a social anthro- 
pologist who has worked with an unnamed 
tribe in the Grand Canyon for 40 years, and 


E the early 1990s, researchers at Arizona 


DeLanna Studi (left) and Tina Benko in 
Informed Consent. 


Informed Consent Jillian (Tina Benko) is 
DEBORAH ZOELAUFER an ambitious genetic 
= a nen anthropologist strug- 
eet, New York City. : : 7 
Until 13 September. gling with the knowl 


edge that she carries a 
genetic mutation that ensures that she will 
develop early-onset Alzheimer’s disease. 

Jillian jumps at Ken’s offer to run a study 
looking for diabetes markers in tribe mem- 
bers’ DNA. Soon, she reveals other inten- 
tions, such as studying how the tribe came 
to America — even though this would 
conflict with their own story about where 
they came from. Ken emphasizes that this 
is “strictly” a diabetes study. The fourth wall 
fading momentarily, Jillian tells the audi- 
ence that he never said “strictly”. Other cast 
members jump in to support her version, 
and Ken revises his line — a playful swipe at 
the indefinite nature of remembered truth. 

Jillian convinces tribe members to hand 
over samples of their blood — which they 
deem sacred — by suggesting that DNA 
tests are their only hope of halting the dia- 
betes epidemic. She also deludes herself into 
thinking that the work will lead to funding 
to pursue a cure for Alzheimer’s. Her ambi- 
tion is driven by fear, for herself and for her 
daughter Natalie (DeLanna Studi), who has 
a 50% chance of inheriting the mutant gene. 
Ina parallel plot line, Jillian and her husband 
fight over whether to test Natalie. 

The play crescendos as a tribal spokes- 
person (also played by Studi) confronts Jillian 
at a talk on the tribe’s early migrations out of 
East Asia, saying that she had no right to use 
the samples to study this. Under threat of a 
lawsuit, Jillian’s university fires her and the 
remaining samples are ceremonially returned. 

In the mid-2000s, when the real case came 
to wide attention, progress on techniques for 
studying DNA was outpacing understanding 
of how the research might affect participants. 
There was no clear evidence that Markow or 
Martin had broken any rules, but the case 
and others prompted reconsideration of 
informed-consent documents, which are 
meant to lay out the risks for participants. 

Laufer acknowledges the limits in present- 
ing this story with certainty, especially in an 
engaging stage play, which Informed Consent 
most definitely is. She notes: “I guess what 
happened is much less important than what 
we can learn from the outcomes.” = 


Brendan Maher is a features editor at Nature. 
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MENTAL-HEALTH CARE 


Italy’s psychiatric renaissance 


Andrea Tone admires the history of a radical shift in the treatment of mental illness. 


p to the 1960s, people in Italy’s 
| asylums were stripped of their pos- 
sessions and personhood: victimized 
by rigid rules, overcrowding and filth, tied to 
beds or caged and separated from the outside 
world by towering walls. By the late 1970s, the 
landscape had been transformed bya reform 
movementled by ‘critical psychiatrist’ Franco 
Basaglia. Historian John Foot explores this 
extraordinary revolution and its legacy in 
The Man who Closed the Asylums — first pub- 
lished in Italian as La “Repubblica dei Matti” 
(The ‘Republic of the Mad’; Feltrinelli, 2014). 

The movement, Foot shows, came to con- 
demn asylums as morally bankrupt and of 
limited therapeutic benefit. Spearheaded by 
radical psychiatrists and joined by journal- 
ists, intellectuals, patients, activists and poli- 
ticians, it laid the foundation for a reform 
act, Law 180. Passed in 1978 by health com- 
missioners rather than Parliament, the law 
authorized the de-institutionalization of 
patients, the establishment of community- 
based psychiatric services and the closure of 
asylums. That process finished only in the 
late 1990s, but it returned 100,000 patients 
to society. For the first time, a broad coali- 
tion united in rejection of asylums to parlay 
its views into a national political mandate. 

The revolution began in Gorizia. An 
isolated city on the border of what was 
then Communist Yugoslavia, Gorizia was 
politically centre-right: mostly Christian 
Democrats, with some neo-fascist groups. It 
was an unlikely site to spark a revolution. 

Its asylum, like others in Italy, was “dark 
and sinister; a place of “horror”, notes Foot. In 
1961, Basaglia became its director, primed for 
change. After spending six months in prison 
for anti-fascist politics under Benito Musso- 
lini’s Fascist government, Basaglia knew how 
institutional power could thwart individual 
liberties. His critique of asylum psychiatry 
was influenced by philosopher Michel Fou- 
cault and sociologist Erving Goffman, as well 
as psychiatrist Maxwell Jones, whose work 
turning UK asylums into open therapeu- 
tic communities gave Basaglia, and his wife 
Franca Ongaro, a new paradigm. 

In Gorizia, Basaglia assembled a team — 
the “é€quipe” — of physicians and reformers 
including Giovanni Jervis and Ongaro, the 
latter of whom Foot credits as the principal 
author of Basaglia’s myriad publications. By 
the mid-1960s, the équipe had opened up the 
asylum. Freed from shackles, patients rose 
and slept when they pleased. They socialized 
at the institution’s new bar (equipped with a 


Franco Basaglia in Colorno, Italy, in 1971. 


jukebox), saw films, played sport and gave 
voice to their experiences in a patient-run 
magazine, II Picchio (The Woodpecker). Many 
took day trips and mixed with locals. Patients 
and staff met daily to discuss their care and to 
debate broader questions. Were reforms the 
goal, ora prelude to closing the asylum? How 
should tensions between asylum leaders and 
local authorities — not all of whom embraced 
having patients in their midst — be resolved? 

Despite ideological divisions within and 
beyond the asylum, 


“Small groups Gorizia became 
of people m famous. Activists 
peripheral demanding free- 
places can dom and societal 


change history.” change visited to 
witness “new forms 
of democracy in action”. The Negated Institu- 
tion (Einaudi, 1968), a best-selling book by 
the collective with Basaglia as editor, detailed 
the transformation and propelled critical 
psychiatry into the national spotlight. 

By then, Basaglia and others had come 
to believe that patients would be better off 
in communities with decentralized psy- 
chiatric services. He left Gorizia along with 
some members of his team, and the “Gorizia 
diaspora’, along with the popularity of The 
Negated Institution, had a ripple effect. Psy- 
chiatric care was restructured in other Ital- 
ian cities, including Reggio Emilia, Parma 
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Basaglia and the 
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JOHN FOOT 
* Qy Verso: 2015. 
and especially Trieste. Basaglia’s work there 
culminated in its asylum’s closure in 1980 — 
the year that Basaglia died of a brain tumour. 
Trieste became a centre of pilgrimage for criti- 
cal psychiatrists and a model for reform rec- 
ognized by the World Health Organization. 

The Man Who Closed the Asylums is 
ambitious and well researched. It is also 
detailed and long, packed with block quotes 
anda large cast of characters, as well as a smat- 
tering of case studies of asylums and local 
politics, making it an occasionally choppy 
read. It is less a biography of Basaglia than a 
contextualization of the man and the move- 
ment in a network ofhistorical actors and the 
politics and ideology of post-war Italy. 

Foot says little about how patients were 
diagnosed, the spectrum of their disorders 
and medical interventions, particularly after 
psychiatric drugs such as the anti-psychotic 
chlorpromazine reached Italy in the 1950s. 
Scholars such as Joel Braslow and Andrew 
Scull have generated a vast literature on these 
questions, which Foot sidesteps. Also missing 
is a discussion of what non-asylum psychiat- 
ric care, if any, existed in Italy before Law 180. 

Foot’s argument would have benefited 
from more comparative analysis. Before 
Gorizia, many North American asylums had 
introduced reforms. In 1956, for instance, the 
use of anti-psychotics encouraged US public 
asylums to discharge more people than they 
admitted. Yet, as Foot avers, no country or 
movement went as far as Italy in recalibrat- 
ing the organization and politics of mental- 
health care. Law 180 (and broader health 
reforms that soon subsumed it) was unevenly 
enforced, failed to create a singular model of 
care and was undercut by political opposi- 
tion. Yet Foot’s impassioned story reminds 
us that the future is neither immutable nor 
ordained, and that small groups of people in 
peripheral places can change history. = 


Andrea Tone is professor of history and 
Canada Research Chair in the Social History 
of Medicine at McGill University in Montreal. 
She is the author of The Age of Anxiety. 
e-mail: andrea.tone@mcgill.ca 
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Halt reef destruction 
in South China Sea 


China’s land reclamation in 

the South China Sea has raised 
geopolitical concerns (see 
go.nature.com/tmrpih). It is also 
destroying coral reefs on a large 
scale — together with their fragile 
ecosystems. 

This loss of reef habitat is 
estimated from high-resolution 
satellite imagery to be about 
1,400 hectares so far in the 
Spratly Islands (http://amti.csis. 
org/island-tracker). The reefs 
are among the most biologically 
diverse on the planet. They are 
home to threatened turtle and 
bird species, a crucial stopover for 
migratory birds, and a source of 
larvae for the South China Sea’s 
overexploited coastal fisheries. 
Further dredging will kill nearby 
sediment-dwelling communities 
and sea-grass nursery habitats 
that take up carbon. 

The global scientific, 
conservation and legal 
communities must unite to halt 
this destruction. Reefloss can be 
publicly monitored with Google 
Earth imagery and tools, enabling 
international governing bodies to 
enforce China’s obligations under 
international law to protect and 
sustainably manage their reefs 
(see go.nature.com/wdzf9k). And 
funding should be fast-tracked 
for the proposed protective 
Spratly Islands Marine Peace Park 
(J. W. McManus et al. Ocean Dev. 
Int. Law 41, 270-280; 2010). 
Elizabeth M. P. Madin 
Macquarie University, Sydney, 
Australia. 
dr.elizabeth.madin@gmail.com 


STEM teaching: 
medicine in Europe 


The innovations to science, 
technology, engineering and 
mathematics (STEM) education 
you recommend (Nature 523, 
272-274 and 282-284; 2015) 
have already had a major impact 
on the teaching of medicine in 
Europe over the past 15 years, 
especially in the United 


Kingdom and the Netherlands. 
These have been inspired by 
initiatives such as Best Evidence 
Medical Education (BEME; 
www.bemecollaboration.org) 
and ASPIRE (www.aspire-to- 
excellence.org), both run by 

the Association for Medical 
Education in Europe. 

There have been 32 systematic 
reviews examining the literature 
to determine which teaching 
methods work and why, and 
how they could be made more 
effective — including three 
highly cited BEME reviews 
(S.A. Azer Acad. Med. 90, 
1147-1161; 2015). The ASPIRE 
initiative has ensured that 
teaching is now recognized 
internationally alongside 
excellence in research (see 
R.M. Harden and D. Wilkinson 
Med. Teach. 33, 95-96; 2011). 

There has also been a 
drastic change in performance 
assessment of medical 
students. This no longer 
relies on regurgitating facts. 
Examples include the objective 
structured clinical examination, 
workplace-based assessments 
and entrustable professional 
activities, which emphasize 
the importance of feedback in 
monitoring and supporting 
undergraduate learning. 
Trudie E. Roberts University of 
Leeds, UK; and Association for 
Medical Education in Europe, 
Dundee, UK. 

Ronald M. Harden Association 
for Medical Education in Europe, 
Dundee, UK. 
t.e.roberts@leeds.ac.uk 


STEM teaching: the 
need for wider skills 


Now more than ever, students of 
science, technology, engineering 
and mathematics (STEM) 

need to develop skills for real- 
world problem solving (see 
Nature 523, 272-274; 2015). 
These include leadership, 
collaboration, communication 
and interdisciplinary thinking — 
all sought after by companies and 
organizations that must compete 


in today’s global economy. 

In the Integrated 
Concentration in Science (iCons) 
Program I have launched at the 
University of Massachusetts 
Amherst (see www.cns.umass. 
edu/icons-program), STEM 
students explore the part that 
science can play in tackling 
societal challenges such as 
cancer, climate change and 
clean energy. For example, one 
iCons team identified electrical 
energy storage as a crucial 
societal need, which motivated 
them to design and perform an 
original investigation on the 
efficient electrolysis of water. 
Such innovative educational 
experiences generate new 
categories of learning that 
broaden the meaning of student 
success — important both for 
students and for the future. 
Scott Auerbach University of 
Massachusetts Amherst, USA. 
auerbach@chem.umass.edu 


Funders must not 
coerce scientists 


The reported complicity of 
the American Psychological 
Association (APA) in the 
‘enhanced interrogation 
programme of the US 
Department of Defense (DOD) 
represents improper exploitation 
by the DOD ofits relationship 
with the APA in pursuit of 
its own objectives, distorting 
scientific ethics and practices in 
the process (see Nature 523, 255; 
2015). The scientific community 
as a whole could learn from the 
mechanisms that the DOD used 
to influence the APA, as reported 
in an independent review 
(go.nature.com/4vpdob). 
Historically, similarly 
inappropriate relationships 
have sometimes developed 
between granting agencies and 
scientists (see, for example, 
S. Reverby Examining Tuskegee 
(Univ. North Carolina Press, 
2009) and Nature 467, 645; 
2010). Scientists’ alignment with 
institutional preferences is often 
a force for good, but vigilance 


is needed to prevent erosion of 
ethical or evidential standards. 

Institutional coercion such 
as that revealed in the report 
is disastrous for the public 
perception and integrity of science. 
The moral force of research 
standards is weakened if they are 
subject to political concerns. 

We all need to address 
this potential for abuse 
without impeding the normal 
functioning of scientists and 
their granting institutions. For 
example, negotiations on ethics 
and research practices between 
institutions could be overseen to 
ensure good-faith participation 
and eliminate any back-door 
dealing. Such regulatory 
authority could be absorbed 
by existing bodies such as the 
US Federal Inspectors General. 
Cole Helsell University of 
California, San Francisco, USA. 
cole.helsell@ucsf.edu 


India’s inspiring 
former president 


A.P. J. Abdul Kalam (1931-2015) 
was the first great scientist to 
become president of India (see 
Nature India go.nature.com/ 
x8rltb). He died on 27 July while 
delivering a lecture on “The 
liveable planet’ to students at the 
Indian Institute of Management 
in Shillong. 

Dubbed the ‘missile man’ 
because of his pioneering 
years as head of India’s missile 
programme, he was a big 
supporter of fundamental and 
applied science in the country. 
He was a favourite among 
students, igniting the minds 
of primary-school children 
and PhD students alike about 
science through his passion and 
approachability, patience and 
simplicity. 
Bhaswati Chatterjee National 
Institute of Pharmaceutical 
Education and Research, 
Hyderabad, India. 
Suman S. Thakur Centre for 
Cellular & Molecular Biology, 
Hyderabad, India. 
sst@ccmb.res.in 
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Tagged tags engage disposal 


Damaged organelles called mitochondria were once thought to be disposed of by a simple signalling cascade. Cell-based 
analyses now reveal that a network of complicated molecular interactions initiates disposal. SEE ARTICLE P.309 


NORIYUKI MATSUDA & KEIJI TANAKA 


itochondria are the energy- 
Mees factories of the cell, so it 

is essential for cellular health that 
they function properly. When mitochondria 
are damaged, for example by stress or ageing, 
the electrical potential across their membranes 
decreases. These depolarized mitochondria 
are then marked with the molecule ubiquitin, 
which tags them for disposal. Disruption of this 
process might lead to the accumulation of dam- 
aged mitochondria, and has been implicated in 
Parkinson's disease’. The molecular connection 
between ubiquitin and mitochondrial disposal 
has been controversial, but in a paper on page 
309, Lazarou et al.” demonstrate that this inter- 
action is mediated by phosphorylation. 

Unwanted or damaged components of 
the cell are digested or recycled en masse by 
an intracellular degradation process known 
as autophagy. Increasing evidence indicates 
that selective forms of autophagy (such as 
mitophagy, which targets only mitochondria) 
enable intracellular quality control by specifi- 
cally disposing of one type of damaged com- 
ponent. During mitophagy, the kinase enzyme 
PINK] and the ubiquitin ligase enzyme Parkin 
cooperate to attach ubiquitin tags to proteins 
on the outer membrane of depolarized mito- 
chondria’ — a process called ubiquitylation. 

In 2010, it was proposed’ thata linear signal- 
ling cascade leads to the clearance of damaged 
mitochondria. In this model, PINK1 accumu- 
lates on mitochondria in response to a decrease 
in membrane potential, and recruits Parkin, 
which catalyses the ubiquitylation of outer 
mitochondrial membrane (OMM) proteins. 
Autophagy receptor proteins such as p62 then 
assist in degradation by binding to both the 
ubiquitylated substrate and to components of 
the autophagy machinery, such as the protein 
LC3. LC3 coats structures called phagophores, 
which engulf mitochondria in preparation for 
autophagic degradation. 

However, this simple model has since 
been questioned, for several reasons. First, 
although the disappearance of OMM pro- 
teins was initially used to monitor mitophagy, 
later papers** reported that OMM proteins 
ubiquitylated by Parkin are subject to degra- 
dation through a different pathway. Second, 


= Ubiquitin 
—___ OMM protein 


il ,tiw 


Damaged mitochondria 


Phagophore 


Figure 1 | The molecular mechanism of mitophagy. a, During mitophagy, the kinase enzyme PINK1 
accumulates on damaged mitochondria, phosphorylating (P) the ubiquitin molecules that tag outer 
mitochondrial membrane (OMM) proteins. PINK1 also phosphorylates the enzyme Parkin, and PINK1 
and ubiquitin both activate and recruit Parkin to mitochondria. Parkin adds more ubiquitin to damaged 
mitochondria, thereby amplifying the mitophagy signal. b, Lazarou et al.” add to this already complex 
model by demonstrating that phosphorylated ubiquitin recruits the proteins OPTN and NDP52. These 
autophagy receptors bind to phosphorylated ubiquitin and to proteins such as LC3 that coat structures 
called phagophores — in the case of OPTN, this depends on it being phosphorylated by the enzyme 
TBK1. Phagophores then engulf the mitochondria in preparation for disposal. 


the drug normally used to induce depolariza- 
tion-dependent mitophagy in experimental 
conditions, CCCP, was found to cause other 
cellular changes that also increase the amount 
of autophagy’. Third, the contribution of p62 
to PINK1-Parkin-mediated mitophagy has 
proved controversial”*. 

Over the past two years, several papers” 
have revealed that PINK1 phosphorylates 
ubiquitin and that phosphorylated ubiquitin 
(phospho-ubiquitin) is both a receptor for 
and an activator of Parkin (Fig. 1a). These 
results contradict the simple hierarchy of 
the old model, and instead suggest that the 
components of the pathway can react inter- 
dependently. Ubiquitin might act upstream 
of Parkin as well as downstream, for example, 
because it can activate Parkin when phos- 
phorylated by PINK1. And PINK] might act 
downstream of Parkin as well as upstream 
because Parkin-catalysed ubiquitin tags can 
be substrates for PINK1 phosphorylation. 

To further investigate the interactions 
between components of the mitophagy cascade, 
Lazarou et al. studied mitophagy in cells that 
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lacked genes encoding five autophagy receptors 
(p62, NBR1, NDP52, TAX1BP1 and OPTN), 
either individually or in combination. To ensure 
that mitophagy was monitored accurately, 
the authors used degradation of mitochon- 
drial DNA, rather than degradation of OMM 
proteins, as an indicator of mitophagy. As a 
second indicator, they analysed spectral shifts in 
mitochondrial fluorescent protein (shifts occur 
when mitochondria are engulfed for degrada- 
tion). To depolarize mitochondria, the research- 
ers used the drugs oligomycin and antimycin 
A, which target mitochondria more specifically 
than do pan-depolarizing agents such as CCCP. 

Lazarou and colleagues found that p62 and 
NBR1 are not essential for PINK1-Parkin- 
mediated mitophagy, but that NDP52 and 
OPTN (and to some extent, TAXBP1) are 
required for mitophagy, a finding that is partly 
consistent with previous studies”*’*. The 
authors also showed that the ubiquitin-binding 
domains of NDP52 and OPTN are essential for 
the proteins’ ability to translocate to depolar- 
ized mitochondria and for mitophagy to occur. 
Furthermore, deletion of TBK1, a kinase that 


phosphorylates OPTN, revealed the importance 
of this phosphorylation for effective mitophagy. 

Under the old model, Parkin-mediated 
ubiquitylation of OMM proteins is required to 
recruit autophagy receptors to mitochondria. 
Intriguingly, the authors found that attaching 
PINK1 to mitochondria recruited OPTN and 
NDP35z2, even in the absence of Parkin or mito- 
chondrial damage. But the receptors could 
not translocate to mitochondria if ubiquitin 
was not phosphorylated, or if the receptors 
were mutated to lack their ubiquitin-binding 
domains. Finally, Lazarou and colleagues 
showed that OPTN and NDP32 bind prefer- 
entially to phospho-ubiquitin rather than to 
ubiquitin. 

Armed with these data, the complex 
interactions at play can be untangled. After 
damage to the mitochondria, PINK1 phos- 
phorylates ubiquitin that is already attached 
to OMM proteins. The resulting phospho- 
ubiquitin is crucial for recruiting OPTN and 
NDP52 to mitochondria. Parkin assists in the 
process, amplifying the phospho-ubiquitin by 
setting up a positive feed-forward cycle, so that 
more ubiquitin attaches to the OMM proteins. 
This enables OPTN and NDP352 to engage com- 
ponents of the autophagy machinery to initiate 
mitophagy (Fig. 1b). 

This updated model is consistent with the 
authors’ surprising observation that OPTN 
and NDP52 can be recruited to mitochon- 
dria independently of Parkin. However, the 
model does not imply that Parkin has no part 
to play, because Lazarou and colleagues found 
that Parkin expression dramatically increased 


OPTN-dependent mitophagy. Strong genetic 
interactions between the PINK1 and PARKIN 
genes in model systems have been reported”. 
Moreover, familial Parkinson’s disease can 
be caused by mutations in either PINK1 
or PARKIN in humans’. So Parkin clearly 
cooperates with PINK1 and has an important 
role in PINK1-mediated mitophagy under 
physiological conditions. 

The findings of the current study involved 
only in vitro and cell-culture experiments, so 
their physiological significance in vivo remains 
to be demonstrated. Further biochemical and 
structural studies should explore how the phos- 
phorylation of ubiquitin mediates its interac- 
tions with OPTN and NDP352. Such types of 
investigation are already improving our under- 
standing of how Parkin is regulated by PINK1- 
phosphorylated ubiquitin — an interaction 
that was also discovered through in vitro and 
cell-culture experiments”. For instance, a 
structural study'* published on Nature’s website 
reveals the molecular basis for the way phos- 
pho-ubiquitin regulates Parkin. And a recent 
study” of post-mortem brain samples and cells 
called fibroblasts from people with Parkinson's 
disease points to the physiological significance 
of phosphorylated ubiquitin in humans. Once 
Lazarou and colleagues’ work has been simi- 
larly validated, it could establish a new way of 
thinking about how simple ubiquitin tags play 
multiple specific parts, and will surely provide 
inspiration for further exploration of ubiquitin 
and autophagy. m 


Noriyuki Matsuda is at the Ubiquitin Project, 


Bumps in the road to 
therapeutic antibodies 


A human antibody against dengue virus serotype 2 has been shown to protect 
mice against disease. Structures of the antibody bound to the virus illuminate 
how it binds different viral forms to prevent virus entry into cells. 


LESLIE GOO & THEODORE C. PIERSON 


engue viruses are mosquito-borne 
D flaviviruses that circulate as four 

related serotypes. Around 400 million 
human dengue virus infections occur across 
the globe annually. Roughly 25% of these result 
in clinical symptoms ranging from a self-limit- 
ing fever to a potentially fatal vascular-leakage 
disease’. Despite this disease burden and dec- 
ades of research, no effective vaccine or thera- 
peutic agent is available. To redress this, much 
research has focused on understanding how 
antibodies bind to dengue virus particles and 


inhibit infection, because the presence of neu- 
tralizing antibodies is thought to correlate with 
protection following vaccination, and such 
antibodies hold promise as antiviral drugs”. 
Writing in Science, Fibriansah et al.’ show that 
the monoclonal antibody 2D22 binds mul- 
tiple envelope proteins on the surface of the 
virus particle in a manner that prevents the 
conformational rearrangements necessary for 
virus entry into cells. 

The viral envelope (E) protein is an elongated 
three-domain glycoprotein that orchestrates 
multiple steps in the virus-entry process, includ- 
ing attachment to cellular receptors and fusion 
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of viral and cellular membranes following 
internalization. E proteins are the principal 
targets of neutralizing antibodies, which may 
inhibit infection through several mechanisms. 
How antibodies engage E proteins on virus 
particles (virions), and in what numbers, can 
be influenced by the proteins’ arrangement. 

Pioneering cryo-electron microscopy 
analysis* of dengue virions revealed a dense 
herringbone arrangement of E proteins com- 
prising 30 rafts of three antiparallel dimers 
oriented flat against the viral membrane. 
Subsequent studies”® revealed that not all viri- 
ons have this ‘smooth’ appearance and that, 
at 37°C, the virions of some dengue strains 
take on an expanded structure with a ‘bumpy’ 
surface, on which the E-protein dimers are 
more loosely arranged (Fig. 1a). Whether 
antibodies distinguish between these smooth 
and bumpy structures, or influence tran- 
sitions between these states, has not been 
extensively studied. 

The antibody 2D22 was isolated from an 
individual with immunity to dengue virus 
serotype 2 (DENV2). This antibody neutral- 
izes DENV2 (but not other serotypes) in vitro, 
and has been hypothesized to bind a molecular 
structure (epitope) spanning more than one 
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a Smooth DENV2 at 4 °C 


Bumpy DENV2 at 37 °C 


Figure 1 | Surface structure affects antibody binding. a, The virus particles of some strains of dengue 
virus serotype 2 (DENV2) adopt different surface conformations at different temperatures. At 4°C, 

the surface-envelope (E) proteins are densely arranged as a ‘smooth configuration of 30 rafts of three 
antiparallel dimers (one raft of three dimers is coloured; the red, yellow and blue indicate the three 
domains of each monomer). At 37 °C, the surface becomes ‘bumpy; with the E-protein dimers more 
loosely arranged. b, A close-up of a single E-protein dimer shows the molecular regions (epitopes) 
identified by Fibriansah et al.’ as being bound by the monoclonal antibody 2D22. When the virus surface 
is smooth, 2D22 binds to epitopes that span E proteins within a dimer (amino-acid residues represented 
as cyan spheres), and some that span adjacent dimers (green spheres). The latter contacts are predicted 
to be disrupted on bumpy virions. Molecular graphics and analyses were performed with the UCSF 
Chimera package. PDB IDs for smooth and bumpy virions are 4CCT and 3ZKO. 


E protein’. Fibriansah et al. now demonstrate 
that 2D22 can protect against DENV2 infection 
in immunodeficient mice when administered 
before or after virus infection. 

There is epidemiological evidence for 
increased risk of severe disease following 
secondary infection of an individual with 
a dengue serotype distinct from that of the 
primary infection®; this phenomenon has 
been linked to the activities of poorly neu- 
tralizing antibodies that bind more than one 
serotype, which initiate a mechanism called 
antibody-dependent enhancement of infec- 
tion’. Fibriansah et al. show that variants of 
2D22 engineered to be incapable of binding 
the Fc receptors present on many immune- 
system cells prevented the development of 
lethal antibody-mediated enhanced disease 
in immunodeficient mice. 

The authors then used cryo-electron 
microscopy to solve the structures of 2D22 
in complex with DENV2, for multiple DENV 
strains and at different temperatures. The 
structures show that the antibody binds 
an epitope that bridges E proteins within a 
dimer (Fig. 1b). This is similar to the recently 
reported mode of binding of a group of 


potently neutralizing monoclonal antibodies 
that bind multiple dengue serotypes’”™'. 
However, although this E-dimer epitope 
(EDE) is located within E-protein dimers 
that are the ‘building blocks’ of the virions, 
it was not uniformly displayed on different 
DENV2? strains. The authors’ analysis of a 
primary DENV?2 viral strain revealed smooth 
virus particles at both 4°C and 37 °C that were 
bound by 180 copies of 2D22. By contrast, 
the stoichiometry of antibody binding to the 
laboratory-adapted strain NGC was tem- 
perature dependent — when antibodies were 
bound to smooth virions at 4°C and then the 
temperature was shifted to 37 °C, at which the 
virions adopt the bumpy configuration, one 
third of 2D22 epitopes became unavailable for 
binding. 

This finding implies that the epitopes 
recognized by antibodies, including those 
within the E dimer, may not be equivalently 
accessible on bumpy and smooth particles, 
highlighting a need to understand the struc- 
tural interactions that determine whether a 
given dengue strain will adopt an expanded 
or smooth structure at 37°C. Notably, the 
120 2D22 epitopes present on the NGC strain 
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at 37°C is much greater than the number of 
antibodies predicted to be required for anti- 
body-mediated neutralization of flaviviruses”. 

Multiple lines of evidence suggest that the 
structure of dengue viruses can be variable at 
physiological temperatures. Not all structures 
observed by microscopy are represented by 
the smooth and bumpy configurations, owing 
in part to inefficient maturation or confor- 
mational flexibility of the virion’. How does 
this complexity affect antibody recognition 
and immunity? Researchers have character- 
ized several potently neutralizing human 
antibodies that bind complex epitopes made 
up of more than one E protein’*”’, including 
others that bind to EDE’™””. This form of rec- 
ognition should in principle be sensitive to 
changes in the surface of the virion that alter 
the herringbone arrangement of E proteins, as 
illustrated by Fibriansah and colleagues’ data. 
Amino-acid variation that alters higher-order 
E-protein structures or the strength of dimer 
contacts thus has the potential to affect the 
neutralization potency of these antibodies. 

Conversely, antibodies may have an active 
role in shaping the surface of infectious virions. 
Some EDE-binding antibodies can ‘push’ 
immature virus structures into the E-dimer 
configuration found on mature virions, 
allowing these antibodies to neutralize 
the complex spectrum of virus geometries 
released from infected cells’*"’. The identifica- 
tion of potently neutralizing human antibodies 
such as 2D22 that bind complex epitopes is 
an important step toward identifying targets 
for vaccine and drug design. Such antibod- 
ies also have the potential to provide useful 
insight into the many shapes and forms of 
infectious virions. m 
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ORGANIC CHEMISTRY 


Light opens pathways 
for nickel catalysis 
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+2 is bound to both a carbon and an oxygen 
atom. These complexes are easily accessible 
from nickel(o) but do not readily undergo car- 
bon-oxygen (C-O) bond-forming reactions. 
Instead, a nickel(111) intermediate, generated 
through single-electron-transfer oxidation, is 
required for efficient reductive elimination. 
This reactivity has been used in non-catalytic 
C-O bond-formation reactions** in which 


The report of a light-activated catalyst that generates usually inaccessible nickel 
intermediates provides a general strategy that might allow elusive catalytic 


reactions to be realized. SEE LETTER P.330 


JOHN J. MURPHY & PAOLO MELCHIORRE 


the catalyst’s oxidation state mainly dic- 

tates which reactions are possible. The 
ability to modulate the inherent oxidation 
state of a metal intermediate by applying an 
external stimulus can therefore enable other- 
wise inaccessible catalytic modes of reactivity, 
opening up unprecedented 
opportunities for innovation in 
metal-mediated catalysis. On a 
page 330, Terrett et al.' describe 
how this ideal can be achieved 
using a catalyst activated by vis- 
ible light. Their strategy allowed 
them to develop a general method 
of coupling carbon atoms to oxy- 
gen atoms by using a nickel cata- 
lyst, even though nickel is usually 
resistant to this type of chemical 
transformation. 

The field of nickel catalysis has 
made tremendous advances in 
the past decade’, with valuable 
chemical transformations being 
developed to synthesize natural g 
products, polymers and pharm- 
aceuticals. Nickel is the lightest 
element in group 10 of the peri- 
odic table, a group it shares with 
palladium and platinum, and 
it has similar reactivity to these 
elements. It can mediate reac- 
tions to construct carbon-carbon 
and carbon-heteroatom bonds 
(heteroatoms are any atoms other 
than carbon or hydrogen), com- 
plementing established protocols 
catalysed by more-expensive 
noble metals. 

Nickel is particularly attractive 
for catalysis because it has several 
oxidation states that are primed 
for distinct modes of reactivity. 
Nickel-catalysed reactions gener- 
ally involve redox events in which 
one or two electrons are trans- 
ferred, and which interconvert 
organometallic nickel inter- 
mediates, changing the metal’s 
oxidation state. In particular, 


: or catalysis mediated by transition metals, 
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high-valent nickel complexes (in which nickel 
is in oxidation state +3 or +4) have a greater 
tendency to promote carbon-heteroatom cou- 
pling reactions** than do their lower-valent 
counterparts (in which nickel is in oxidation 
states 0, +1 or +2). In chemistry jargon, they 
are more susceptible to reductive elimination. 

For example, organonickel(11) alkoxides are 
complexes in which nickel in oxidation state 
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Figure 1 | Switching oxidation states for nickel catalysis. a, Terrett et al.’ 
reporta method for constructing alkyl aryl ethers from simple alcohols and 
aryl bromides at room temperature, in which the redox activity of a light- 
activated iridium (11) catalyst is integrated with a nickel catalytic cycle. 

R represents different attached chemical groups; Ar represents aromatic 
groups (benzene rings and related structures); Br, bromine. b, In the proposed 
mechanism, the aryl bromide adds to a nickel(o) species, producing a 
nickel(11) intermediate, ArNi(11)Br. c, This reacts with the alcohol to generate 
an inert organo-nickel(11) alkoxide, ArNi(11)OR. d, e, The authors propose 
that when the iridium(111) catalyst is irradiated by light (d), the resulting 
excited complex (*Ir'”) oxidizes ArNi(11I)OR through single-electron transfer 
(SET; e), generating an Ir(11) species and a nickel(111) intermediate, 
ArNi(111)ORBr. f, This intermediate undergoes reductive elimination to yield 
an alkyl aryl ether and a nickel(1) species, Ni(1)Br. g, SET from the iridium(1) 
species to Ni(1)Br restores the original oxidation states of the two metal 
catalysts, completing both catalytic cycles. 
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preformed organo-nickel(11) alkoxides are 
oxidized. But developing a catalytic nickel- 
mediated system to forge C-O bonds* poses 
more of a conundrum. The main problem 
is the need for an oxidant that not only pro- 
motes the formation of the crucial high-valent 
intermediates (organo-nickel(111) alkoxides) 
from catalytic nickel(11) precursors, but is 
also compatible with the reactivity of metal 
species in lower oxidation states — mainly 
nickel(o) species — that are also present in the 
catalytic cycle. 

Terrett and colleagues have solved this prob- 
lem by integrating the redox activity of an 
iridium(11) catalyst that absorbs 
visible light (an iridium(111) pho- 
tocatalyst) with a nickel catalytic 
cycle. They used this strategy to 
prepare a wide array of syntheti- 
cally valuable compounds called 
alkyl aryl ethers at room tem- 
perature from readily available 
starting materials (alcohols and 
aryl bromides; Fig. 1). Crucially, 
on absorption of weak light, the 
iridium(111) photocatalyst enters 
an excited state (*Ir'”) in which 
iridium can readily engage in an 
array of redox processes. Spe- 
cifically, *Irl takes an electron 
from the otherwise inert organo- 
nickel (11) alkoxide to yield an elec- 
tron-rich iridium(1m) centre and 
the requisite nickel(111) alkoxide, 
which then undergoes the desired 
C-O coupling to form the alkyl 
aryl ether. 

After the product has been 
released, the nickel catalyst, now 
in oxidation state +1, must be 
reactivated so that it can undergo 
further catalytic cycles. The elec- 
tron-rich iridium(11) species gives 
it an electron to form a nickel(o) 
species, which can easily add to an 
aromatic bromide (an event 
known as oxidative addition). This 
final electron-transfer process 
simultaneously returns the two 
metal catalysts to their original 
oxidation states. 

Terrett and co-workers’ pro- 
posed mechanism suggests that 
multiple nickel species, in dif- 
ferent oxidation states, are tran- 
siently generated. Therefore, the 
problem that typically complicates 
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the development of nickel-catalysed pro- 
cesses under oxidizing conditions — achiev- 
ing oxidation of the nickel(11) intermediate 
without oxidizing the nickel(o) species — 
might also be an issue here. But because the 
iridium photocatalyst is partitioned within 
different oxidation states by light activation, 
it can donate and receive electrons at differ- 
ent stages of the reaction, ensuring a kind of 
redox equilibrium. So although nickel(o) oxi- 
dation could occur through the action of the 
iridium photocatalyst, this would probably 
bea reversible process; only when the crucial 
organo-nickel(11) alkoxide is generated will 
productive C-O coupling occur. 

Over the past few years, the redox properties 
of iridium photocatalysts have been used to 
develop a variety of light-driven transforma- 
tions, establishing a field known as photore- 
dox catalysis’. Combinations of photoredox 
and nickel catalysis are also known*”, in which 
the photocatalytic process generates carbon- 
centred radicals, which are then captured by 
nickel(1r) intermediates. But Terrett and co- 
workers’ study is particularly exciting because 
it demonstrates that the light-activated iridium 
catalyst can participate directly in the nickel 
catalytic mechanism by leveraging the oxida- 
tion state of the organometallic intermediates. 

Moreover, the authors provide a great 
practical advance in the catalytic formation of 
alkyl aryl ethers. The creation of such ethers 
from alcohols and aryl halides is one of the 
most widely used chemical transformations 
in modern synthetic chemistry. But Terrett 
and colleagues’ method requires much milder 
reaction conditions than do existing catalytic 
technologies, including those based on palla- 
dium” or copper’! — that is, the reactions can 
be conducted at ambient temperature by using 
blue-light-emitting diodes as the energy source, 
rather than at higher temperatures, and avoid 
the use of strong bases that can compromise the 
stability of the reagents. What is more, water 
can be used asa substrate instead of alcohols, 
allowing easy access to products called phenols. 
The authors demonstrate a wide range of sub- 
strates, but there are still many opportunities to 
expand the scope of the reaction. 

The discovery that a photoredox catalyst, 
activated by weak light, can unlock previously 
inaccessible nickel coupling mechanisms 
will greatly interest synthetic organic chem- 
ists, and create opportunities to realize other 
elusive catalytic reactions. In a similar vein, a 
ruthenium photocatalyst has recently been 
reported” to enable a difficult carbon-nitro- 
gen bond-forming reaction by driving the 
oxidative formation of a key nickel(11) inter- 
mediate. Further insights into the intricate 
networks of redox processes in these related 
reactions are needed to translate the methodo- 
logical blueprint to other metal species and 
organometallic reactivities. = 
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Hypoxia response 
becomes crystal clear 


The crystal structures of two proteins that respond to reduced tissue oxygen 
levels — hypoxia-inducible factors — provide insight into their function and 
reveal sites for rational drug design. SEE ARTICLE P.303 


RONEN MARMORSTEIN & M. CELESTE SIMON 


xygen is a key substrate for cellular 

energy production and numerous 

intracellular biochemical reactions. 
Maintaining oxygen homeostasis is there- 
fore essential for the survival of most species. 
Oxygen deprivation (hypoxia) triggers com- 
plex responses at the cellular, tissue and 
organismal levels, to match oxygen supply 
with metabolic and bioenergetic demands. 
Several evolutionarily conserved molecular 
responses to hypoxia are mediated by 
hypoxia-inducible factors’ (HIFs) — trans- 
cription factors that regulate metabolism, 
stem-cell behaviour, the generation of blood 
vessels and red blood cells, inflammatory 
responses and cell motility. The first crys- 
tal structures of two HIFs are described by 
Wu et al.’ on page 303 of this issue. 

The involvement of HIFs in so many 
cellular pathways means that the proteins 
play a central part in normal development 
and physiology, and are dysregulated in many 
diseases, including cancer. Yet despite the fas- 
cinating biology that they mediate, and their 
importance as drug targets, the molecular basis 
for HIF activity has remained incompletely 
characterized, impeding the development of 
highly specific inhibitors. A primary reason 
for this is that HIF proteins, which are hetero- 
dimeric complexes consisting of subunits of 
HIF-a and ARNT (aryl hydrocarbon receptor 
nuclear translocator), have not previously been 
crystallized for high-resolution X-ray structure 
determination (other than heterodimers of the 
PAS-B domains of HIF-2a and ARNT; ref. 3). 
Wu et al. have now achieved this, and present 
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structures of HIF-la-ARNT and HIF-2a- 
ARNT complexes — alone, bound to their cor- 
responding hypoxia-response DNA elements 
(HREs), and bound to the previously identi- 
fied small-molecule inhibitors 0X3 (ref. 4) and 
proflavine’. 

The structures show that the HIF-2a and 
ARNT subunits wrap around each other in a 
non-symmetrical way, such that their respec- 
tive basic helix-loop-helix (bHLH), PAS-A 
and PAS-B domains make different contacts 
with neighbouring domains (Fig. la). 
Although the domains of HIF-2a interact 
contiguously with each other to generate an 
elongated substructure, the ARNT domains 
do not interact with each other, but instead 
bind HIF-2a domains in the heterodimeric 
complex. This form of dimerization is highly 
unusual. Moreover, the domain-—domain 
interactions are distinct from those seen in 
another heterodimeric transcription fac- 
tor that has bHLH and PAS domains, the 
CLOCK-BMALI complex’. These observa- 
tions suggest that ARNT is highly selective 
for associations with HIF-la and HIF-2a, and 
that the protein subunits also help to stabilize 
the complex. Wu et al. confirmed this func- 
tion by mutating ARNT and HIF-a interface 
amino-acid residues and showing that several 
of these mutations inhibit association of the 
subunits in vivo. 

The bHLH-PAS domain structure is char- 
acteristic of several related family-member 
transcription factors, such as AhR (ref. 7) and 
NPAS2 (ref. 8), which have been shown to use 
their respective PAS motifs to bind small mol- 
ecules. But whether the same is true for HIF 
complexes was not known. Wu and colleagues’ 


structure of HIF-2a-ARNT reveals five poten- 
tial binding ‘pockets’ for small-molecule com- 
pounds, one in each of the PAS domains of 
HIF-2a and ARNT, with the fifth lying at the 
interface between HIF-2a PAS-B and ARNT 
PAS-A (Fig. la). The authors also present a 
structure of HIF-2a-ARNT bound to the small 
molecule proflavine indicating that it binds to 
the interface pocket. This is consistent with the 
previous observation that acriflavine (a mix- 
ture of proflavine and trypaflavin) destabilizes 
HIF-2a-ARNT heterodimers in cells*. A fur- 
ther structure reveals that the HIF-2a inhibitor 
0X3 (ref. 4) binds to the pocket on the HIF-2a 
PAS-B domain. 

The authors also compared the amino- 
acid sequences of the binding pockets of 
HIF-1a and HIF-2a, and found the interface 
(proflavine-binding) pocket to be highly 
conserved, but the PAS-A and PAS-B 
pockets less so. This suggests that small- 
molecule inhibitors may be developed 
that are selective for complexes containing 
either HIF-1a or HIF-2a, or that target both 
complexes. These are both highly desir- 
able goals for drug design, depending on the 
disease context’. The findings also reinvigorate 
the possibility that HIF proteins bind their 
cellular ligands through one or more of 
these subdomains, as has been shown for 
AhR-ligand binding in the gastrointestinal 
system’®. 

Wu and colleagues went on to determine the 
structure of both HIF-la-ARNT and HIF-2a- 
ARNT bound to an HRE sequence — the first 
report of a multidomain bHLH-PAS hetero- 
dimer in association with DNA. As expected, 
both HIF-a-ARNT dimers in complex with 
DNA show the same overall architecture, and 


Proflavine Re 
site ing | ia — 0X3 site 


bHLH 
bHLH 


they use conserved amino-acid residues in 
a DNA-recognition helix region (a1) of the 
bHLH domain to make base-specific contacts 
in the major groove of the DNA. However, 
unexpectedly, a comparison of the DNA-free 
and DNA-bound complexes revealed that the 
amino-terminal end of the al helix under- 
goes a coil-to-helix transition that extends the 
helix to facilitate the base-specific interactions 
(Fig. 1b). The preceding bHLH a2 helix also 
rotates to help orient the al helix for DNA 
recognition. Another surprise from the struc- 
ture is that the PAS-A domain of each HIF-a 
subunit makes contact with the minor groove 
of the DNA approximately six base pairs away 
from the HRE core element, thus extending 
the ‘footprint’ of the HIF-a-ARNT hetero- 
dimer on DNA. These observations reveal that 
HIF-a-ARNT complexes undergo structural 
alteration of the bHLH region, with participa- 
tion of the HIF-a PAS-A domain, for DNA 
recognition. 

Mutations in both HIF-la and HIF-2a 
have been identified from cancer biopsies'””, 
and Wu et al. show that these mutations cor- 
respond to amino-acid residues at interfaces 
between protein domains, DNA-contact resi- 
dues or pocket regions in the PAS domains. 
This finding reinforces the biological impor- 
tance of crystallographic information on 
the HIF-a-ARNT complexes, their mode of 
DNA recognition and the binding of small 
molecules. 

Finally, Wu and colleagues’ findings suggest 
that different bHLH-PAS family members 
integrate distinct environmental cues, and 
that there is not one mode of heterodimeri- 
zation for the bHLH-PAS proteins. Such dif- 
ferently arranged domain-domain junctions 


PAS-A_ PAS-B 


Figure 1 | The HIF-a-ARNT heterodimer off and on DNA. a, The two protein subunits of the 
transcription factor hypoxia-inducible factor 2 (HIF-2) — HIF-2a (purple) and ARNT (green) — each 
have bHLH, PAS-A and PAS-B domains. Wu and colleagues’ crystal structure’ of this heterodimer reveals 
contiguous interactions between the HIF-2a domains, but not between those of ARNT. Furthermore, 

the structures show binding pockets for putative small-molecule ligands (circles), including proflavine 
and 0X3. b, The authors also present a structure of the heterodimer in complex with DNA. The domain 
organization seen in this structure highlights the extension of the bHLH al helices such that they 

can interact with the DNA major groove; reorient the a2 helices to position the a1 helices for DNA 
recognition; and direct minor-groove DNA interactions with the PAS-A domain of HIF-2a. 
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50 Years Ago 


A Star Called the Sun by Dr. George 
Gamow — Weare reminded of 
Dante on his journey to the inferior 
regions: “I would have cast me 

into molten glass to cool me, when 
Ienterd, so intense raged the 
conflagrant mass” ... the author 
compares the Sun with other stars 
and explores the nature of such old 
ones as red giants, white dwarfs 

and supernovae, the stars that “die 
glamorously” ... The style is popular 
and sometimes racy. Occasionally it 
will offend the English reader who 
has some feeling for his Mother 
tongue, but on the whole, in spite of 
his “oodles and oodles’, the author 
has avoided ... ambiguities and 
half-statements. W.L. Sumner 

From Nature 21 August 1965 


100 Years Ago 


The Book of France edited by 
Winifred Stephens — The book 
opens with an appreciation of 
France by Mr. Henry James, 
written with the copiousness 

and verve which characterize 

all his work...Perhaps the most 
arresting and striking contribution 
to the work is the “Debout pour 

la Derniére Guerre!”...done into 
nervous, palpitating English by 
Mr. H. G. Wells ... “La Basilique- 
Fantéme’, by Pierre Loti...is a 
remarkable word-picture of the 
present state of the cathedral of 
Rheims... “The most irreparable 
disaster is that of those great 
stained windows composed by the 
mysterious artists of the thirteenth 
century ... Masterpieces that no 
one can reproduce showered down 
their fragments never to be sorted 
again, their wonderful golds and 
reds and blues, of which the secret 
has been lost, upon the pavement 
stones. Gone for ever those rainbow 
transparencies, gone for ever those 
companies of saints with the charm 
of their simple attitudes.” T.E. Thorpe 
From Nature 19 August 1915 
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not only promote stability, but also allow the 
proteins to incorporate diverse signals and 
communicate these signals across domains. 
These signals may include cellular ligands, 
disease-related mutations and post-transla- 
tional modifications. The authors’ solving of 
the HIF-a-ARNT structures provides exciting 
opportunities to explore previously unchar- 
acterized cell-regulatory pathways and to 
advance drug discovery. = 
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Multiple origins of 
mountain life 


A study of DNA sequences from more than 1,800 organisms on Mount Kinabalu in 
Borneo reveals the evolutionary mechanisms that led to the mountain’s high and 


unique biodiversity. SEE LETTER P.347 


ALEXANDRE ANTONELLI 


ountains occupy only about one- 
Meee: of the world’s land surface 

outside Antarctica, yet they are 
home to around one-third of all terrestrial 
species'. However, little is known about when, 
where and how this mountain biodiversity 
developed. In a paper on page 347, Merckx et 
al.’ investigate the origins of species on a highly 
diverse tropical mountain, Mount Kinabalu 
on the island of Borneo. They find that most 
endemic species — those found nowhere else 
— arose relatively recently on the mountain, 
deriving from both local ancestors and distant 


Mountain range 


immigrants that were pre-adapted to cool 
environments. These observations have direct 
implications for our understanding of mon- 
tane biodiversity, and may offer clues on how 
to protect both the unique organisms found 
there and the habitats that provide the stage 
for speciation. 

Ever since the first botanical documentation 
of a tropical mountain by Alexander von 
Humboldt more than two centuries ago’, 
naturalists have been fascinated by the diverse 
and unusual variety of life forms found 
on mountains. But we still lack answers to 
fundamental questions about the evolution of 
mountain biodiversity. Did the species living 


Long-distance dispersal 


region 


Figure 1 | Routes to mountain biodiversity. Isolated tropical mountains 
generally contain high levels of species richness and endemicity (species 
uniqueness). Merckx et al.’ show that a large proportion of endemic species 
(indicated by asterisks) on Mount Kinabalu in Borneo derive from lineages 
that were previously present on the island (arrow widths reflect the relative 
number of identified cases; organisms depicted are indicative only). Only 
some of these locally recruited species have adapted to different vegetation 
zones. By contrast, some species, especially those found at high altitudes, have 


Dispersal from 
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Local recruitment 
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This article was published online on 5 August 2015. 


on mountains originate in the surrounding 
lowlands, where their ancestors became suc- 
cessively adapted to higher altitudes? Or are 
montane species mostly pre-adapted immi- 
grants from far-away mountains? And are the 
mountain-dwelling species ancient or recent? 

Merckx and colleagues got to the root of 
these questions by using a new comparative 
approach. Rather than studying the formation 
of species in a single group of organisms and 
extrapolating from there, which has been the 
standard approach so far, the researchers col- 
lected a large variety of organisms — from frogs 
and snails to insects, plants and fungi — that 
inhabit the iconic Mount Kinabalu and its sur- 
roundings. They then sequenced DNA from 
the approximately 1,850 collected specimens, 
compared these sequences with others in their 
own collections and public databases, and cal- 
culated their relationships, ages, geographical 
origins and ancestral environments. 

The first striking result is that most of the 
montane organisms examined are relatively 
young. They started to speciate during the past 
6 million years, after, or at the same time as, the 
rise of the mountain they inhabit. Unlike the 
ancient creatures found on a remote tropical 
mountain in Arthur Conan Doyle’s novel The 
Lost World, this finding suggests a recent origin 
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their origins in similar vegetation zones on other mountain ranges outside 
Borneo, or at lower altitudes in temperate regions, and have arrived by means 
of long-distance dispersal. (Note that vegetation zones on mountain ranges 
are usually at higher elevations than on isolated mountains, owing to heat 
retention and wind shadowing.) Most of these immigrant lineages have then 
undergone local speciation. The evolutionary history of Mount Kinabalu’s 
biodiversity exemplifies the interactions among dispersal, adaptation and 
speciation in generating mountain biodiversity. 


for montane species across the domains of life, 
and supports the recent speciation documented 
for alpine plants on several continents’. 

The second major finding is the dual origin 
of montane organisms (Fig. 1). Some of the 
species, in particular those found at the highest 
elevations of Mount Kinabalu, have their clos- 
est relatives outside of Borneo. Their ancestors 
were often good at dispersing, such as plants 
or fungi that produced large quantities of 
light seeds or spores that could be transported 
with the wind. Other species — about twice as 
many — derive from local ancestors at lower 
altitudes on the same island. The location of 
Mount Kinabalu, surrounded by an exceed- 
ingly diverse tropical forest at the crossroads 
of Asia and Oceania, two regions that have 
their own distinct fauna and flora, apparently 
provided the mountain with a rich initial stock 
for the evolution of its unique biodiversity. 

Finally, the authors’ analysis shows an over- 
arching role for niche conservatism — the 
tendency for organisms to maintain their 
environmental preferences over evolution- 
ary time. This result is evident both from the 
immigrant and the local lineages that gave rise 
to Mount Kinabalu’s biodiversity. Most of the 
ancestral species were already adapted to cool 
conditions, either in temperate regions or in 
other montane habitats. Even the lineages that 
‘climbed up’ Mount Kinabalu often remained 
in the same broadly defined vegetation zone. 
The niche conservatism and pre-adaptation 
shown for the inhabitants of this tropical 
mountain are in line with previous findings 
across the Southern Hemisphere’ and with 
observed patterns of plant movement into cold 
environments around the world®, 

Merckx and colleagues’ study thus provides 
a textbook example of how biodiversity origi- 
nates from the interplay between long-distance 
dispersal and local recruitment, followed by 
adaptation and speciation through interac- 
tion with changes in the landscape, climate and 
environment’. Its limitations are shared with 
other studies that are based on living organ- 
isms and current species distributions. Biologi- 
cal surveys sample only a fraction of the total 
biodiversity of an ecosystem, and large organ- 
isms found in easily accessible sites are typi- 
cally over-represented. Furthermore, estimates 
of speciation events, geographical history and 
niche conservatism obtained from phyloge- 
netic trees are prone to large error intervals and 
many assumptions, and largely disregard the 
confounding effects of extinction. The integra- 
tion of recent and past data — from DNA, fos- 
sils and environmental and geological proxies 
— could remedy these shortcomings’, but such 
data are still scarce or not readily available. 

Much could be gained by applying Merckx 
and colleagues’ whole-community approach 
to other systems around the world. As well as 
increasing our fundamental knowledge of the 
diversity and distribution patterns of species, 
such eco-evolutionary studies would shed light 


on why some regions are much more biodiverse 
than others. A better understanding of the past 
may also, at least to some extent, help scientists 
to manage the present and predict the future. 
Taking climate changes as an example, evolu- 
tionary studies may help to inform us about 
the resilience of species and ecosystems’. They 
may also help to ascertain the role of moun- 
tains as potential reservoirs of biodiversity, 
because montane species need to move only 
short distances to keep their preferred niche”. 
Finally, we may need to devote more resources 
to preserving the natural corridors that link 
vegetation zones along altitudinal slopes if we 
are to safeguard the biotic interchange reported 
by Merckx et al., and thus enhance our protec- 
tion of the world’s unique and rich mountain 
biodiversity. m 
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Mystery survivor ofa 
supermassive black hole 


The G2 cloud in our Galaxy’s core has survived an encounter with the central 
black hole and failed to trigger a major flare-up in the black hole’s activity. 
A promising theory endeavours to explain the cloud’s nature. 


JOHN BALLY 


he centre of the Milky Way hosts a 

supermassive black hole (SMBH) 

weighing approximately 3.6 million 
solar masses. In 2012 a faint, dusty object of 
cloudy appearance was discovered’, accelerat- 
ing towards the SMBH on an eccentric orbit 
with a predicted closest approach (periapse) 
of only about 200 times the mean Earth-Sun 
separation (Fig. 1). During its encounter with 
the SMBH, the object, known as G2, avoided 
complete tidal disruption by the black hole’s 
gravity, implying that it is gravitationally 
bound (held together by gravity). But what is 
the nature of G2? Various early searches failed 
to find an ordinary or an evolved giant star 
lurking in the cloud. Writing in the Astrophysi- 
cal Journal, Mapelli and Ripamonti* propose 
that the central object might be a planetary 
embryo that was dynamically ejected from 
its parent system and is nearly as large as the 
distance between Earth and the Sun. 

The central few light years of the Milky Way 
encompass a dense stellar field containing 
more than 10 million low-mass stars (Fig. 2); a 
comparable volume near the Sun typically only 
contains one star. Among these dim, ancient 
stars, there is a swarm of luminous massive 


Feb 2014 


Sep 2014 


Figure 1 | At close quarters to Sagittarius A*. 
The G2 cloud is shown at different times along an 
orbit that took it perilously close to Sagittarius A*, 
the supermassive black hole in our Galaxy’s 
centre (red cross). These observations show that 
G2 was on a course towards the black hole from 
2006 to early 2014 (reddish colours indicate that 
the object is receding from the observer). Its 
closest approach to Sagittarius A* was in 

May 2014 and observations in September 2014 
(blue indicates that G2 is approaching) show that 
the object survived being ripped apart by the 
black hole. 
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Figure 2 | The Galactic Centre. The numerical density of stars in the central few light years of our 
Galaxy is at least 10’ times that in the Sun’s neighbourhood. This high density can be glimpsed from this 
high-resolution, near-infrared view of the Galactic Centre, which shows young massive stars and old red 
giant stars. The location of our Galaxy’s supermassive black hole, Sagittarius A*, is indicated. Mapelli and 
Ripamonti’ propose that the object G2 may be a recently formed protoplanet that was previously ejected 
from its parent stellar system and was set on an orbit taking it close to Sagittarius A*. 


stars formed in the past 10 million years’, 
surrounded by a ring of molecular gas contain- 
ing the equivalent mass of tens of thousands of 
Suns. The young hot stars emit intense ultra- 
violet light that ionizes a region of hot plasma, 
which is heated by shock waves driven into it 
by powerful stellar winds. Right in the middle 
of this region lurks the SMBH known as 
Sagittarius A* (ref. 4). 

The discovery of the G2 cloud’s headlong 
plunge towards the central black hole caused 
much excitement among astronomers’. Mod- 
els of G2’s orbit predicted that the cloud would 
pass close to the black hole sometime during 
the spring of 2014. Even by late 2013, there 
was tentative evidence that parts ofa tidal gas 
stream escaping from G2 had passed behind 
the SMBH and were emerging on the other 
side. Would the black hole swallow enough gas 
to ignite a burst of emissions activity? Would 
our Galaxy’s centre suddenly brighten across 
the spectrum from X-rays to radio wavelengths 
and briefly resemble the powerful active galac- 
tic nuclei (quasars) associated with SMBHs in 
some galaxies? Dozens of papers emerged with 
predictions. Numerical models® of clouds on 
plunging orbits predicted that a considerable 
amount of gas would spiral into the clutches of 
Sagittarius A*, igniting fireworks. 

A year after the plunge, we can conclusively 
state that there was no major mass-accretion 
event onto the SMBH. Periapse of G2 occurred 
in May 2014 and there was no subsequent 
flare-up of Sagittarius A*, which continued 
its dim flickering in X-ray and near-infra- 
red wavelengths, and remained completely 


oblivious to the object zipping by at thousands 
of kilometres per second. 

What created the gas and dust seen as G2? 
An obvious candidate was the swollen atmos- 
phere ofa red giant or supergiant star. But such 
stars are luminous, and no such star could be 
detected associated with G2. It was proposed’* 
that G2 was a shock front caused by the inter- 
action between a wind from a low-mass star 
embedded in the cloud and the dense plasma 
found in the vicinity of the Galactic Centre. 
It might also have been a protoplanetary disk; 
or a star ripped apart by tidal forces following 
previous close encounters with the black hole; 
or even a merger of two stars. 

Armed with hindsight and more data, a few 
conclusions can now be drawn. The fact that 
G2 survived the plunge towards the SMBH and 
emerged intact”'” means that it must be asso- 
ciated with a compact, gravitationally bound 
object. Its dim infrared emission implies a 
luminosity less than 30 times that of the Sun. 
And because it is best detected in the wave- 
length region between 2 and 5 micrometres, 
its emission is dominated by that of dust at a 
temperature of 400 to 600 kelvin. A plausible 
explanation has been that the G2 cloud con- 
tains a young, low-mass star surrounded by 
a dusty disk that is losing mass, or that it has 
a wind similar to those found escaping from 
young low- to intermediate-mass stars near 
the Sun?" 

Young stars that exhibit excess infrared 
emission, which is an indicator of extreme 
youth, have been found in the Galactic Cen- 
tre before’”. Circumstellar disks irradiated 
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by ultraviolet radiation, such as those found 
in the nearby Orion Nebula”, or those that 
are probably associated with young stars in 
the Galactic Centre, lose mass in a process 
called photo-ablation. Protoplanets, planetary 
embryos or planets in dense star clusters can 
be gravitationally stripped away from their 
parent stars following encounters with other 
stars to become free-floating, sub-stellar-mass 
objects”. 

The occasional encounters of young, low- 
mass objects with the Milky Way’s SMBH may 
cause them® to brighten dramatically as tidal 
forces pull them apart. Therefore, events such 
as the G2 encounter might provide a new way 
of detecting such otherwise invisible, low- 
mass objects. Mapelli and Ripamonti model 
the effects of tidal disruption and photo- 
ablation on G2 and demonstrate that young 
planetary embryos with a mass of about 10 
to 100 times that of Jupiter, or protoplanets 
with diameters comparable to the Earth-Sun 
separation, may become detectable as objects 
resembling G2. This would occur as their 
outer layers are stripped away by the SMBH, 
heat up, and become ionized by the ambient 
radiation field. 

If further analyses show that G2 does not 
contain a star, the authors’ proposal that G2 
is a planetary embryo bound together by its 
relatively weak self-gravity could be a plau- 
sible explanation of this mysterious object. 
Continued monitoring of the Galactic Cen- 
tre by facilities such as the Atacama Large 
Millimeter/submillimeter Array in Chile and 
the Jansky Very Large Array in New Mexico, 
could complement infrared observations 
and tell us about the formation’ of low-mass 
stars, and even planetary systems, in extreme 
environments such as our Galaxy’s centre. m 
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Structural integration in 
hypoxia-inducible factors 


Dalei Wu', Nalini Potluri', Jingping Lu', Youngchang Kim? & Fraydoon Rastinejad' 


The hypoxia-inducible factors (HIFs) coordinate cellular adaptations to low oxygen stress by regulating transcriptional 
programs in erythropoiesis, angiogenesis and metabolism. These programs promote the growth and progression of many 
tumours, making HIFs attractive anticancer targets. Transcriptionally active HIFs consist of HIF-o, and ARNT (also called 
HIF-1f) subunits. Here we describe crystal structures for each of mouse HIF-2a-ARNT and HIF-1a-ARNT heterodimers 
in states that include bound small molecules and their hypoxia response element. A highly integrated quaternary 
architecture is shared by HIF-2a-ARNT and HIF-la—ARNT, wherein ARNT spirals around the outside of each HIF-a 
subunit. Five distinct pockets are observed that permit small-molecule binding, including PAS domain encapsulated 
sites and an interfacial cavity formed through subunit heterodimerization. The DNA-reading head rotates, extends and 
cooperates with a distal PAS domain to bind hypoxia response elements. HIF-a mutations linked to human cancers map 
to sensitive sites that establish DNA binding and the stability of PAS domains and pockets. 


Three HIF-«% proteins (HIF-1%, HIF-2« and HIF-3a) mediate the 
adaptive transcriptional response to hypoxia in both normal and 
tumour cells'*. To function as productive transcription factors, 
HIF-« proteins heterodimerize with the aryl hydrocarbon receptor 
nuclear translocator (ARNT (HIF-1f))*°. The HIF-o and ARNT 
proteins belong to the mammalian basic helix—loop—helix-PER- 
ARNT-SIM (bHLH-PAS) family®’, which includes the aryl hydrocar- 
bon receptor (AHR), neuronal PAS proteins (NPAS1, NPAS2, 
NPAS3, NPAS4), single-minded proteins (SIM1 and SIM2), and the 
circadian clock components BMAL1 and CLOCK (see Extended Data 
Fig. la). By shaping transcriptional responses to environmental and 
cellular signals, these factors drive a variety of physiological and dis- 
ease processes in humans'*. 

The HIF-« proteins function as the oxygen-sensitive regulatory 
subunits of HIF-a-ARNT heterodimers’. HIF-1% and HIF-2« are 
critical for the oxygen stress response’, while the role of HIF-30 
remains less understood’®. Under normoxia, two proline residues in 
the oxygen-dependent degradation domain of HIF-1o and HIF-2« 
are hydroxylated by prolyl hydroxylase domain (PHD) enzymes'’. 
PHD activity requires both oxygen availability and the TCA cycle 
intermediate o-ketoglutarate. The proline modifications target the 
HIF-o proteins to the ubiquitin—proteasome (26S) pathway for degra- 
dation’’. An asparagine residue in their carboxy-terminal segment is 
also hydroxylated’’, blocking HIF-1o interactions with the coactiva- 
tor p300 (ref. 14). 

Under hypoxia, HIF-1a and HIF-2« subunits escape degradation 
and heterodimerize with ARNT to regulate gene programs’"’*. These 
programs increase the oxygen supply and nutrient delivery to cells via 
angiogenesis and erythropoiesis. The HIFs additionally upregulate 
the expression of glucose transporters and glycolytic enzymes, to 
supply the metabolic needs of growing cells in low oxygen conditions. 
These pathways are particularly important for the growth of many 
solid tumours. HIF activity is also critical for the progression of 
kidney-disease-related anaemia, cardiac ischaemia, fatty liver and 
diabetes'*"*. The modulation of HIF pathway by small molecules 
is recognized as a promising approach in cancer therapeutics’’”®. 


The PHDs that regulate HIF-« proteins are actively pursued as 
anticancer targets”; however, the multi-domain structures of HIF- 
a-ARNT complexes have not been visualized, hindering understand- 
ing of their drug-binding capabilities. 

As members of the bHLH-PAS family®, the HIF-a and ARNT 
proteins have an amino-terminal bHLH segment for DNA binding, 
tandemly positioned PAS domains (PAS-A and PAS-B) for dimeriza- 
tion, and transactivation domains (TADs)’ (Fig. 1a). For our studies, 
we used their contiguous bHLH-PAS-A-PAS-B segments, which con- 
tain all the conserved domains of bHLH-PAS proteins (Extended 
Data Fig. 1b). Our current understanding of quaternary structure in 
this family is based only on the CLOCK-BMALI structure”, which 
also used their bHLH-PAS-A-PAS-B segments. The CLOCK- 
BMALI complex was studied in its DNA-free state, and no ligands 
were available to assess potential pockets. Here, we present a gathering 
of five distinct crystal structures for mouse HIF-1a and HIF-2« 
heterodimers with ARNT, including the HIF-20-ARNT complex 
alone and bound to two different small molecules, as well as the 
structures of both HIF-20-ARNT and HIF-lo-ARNT in their 
DNA-bound states. 


Quaternary architecture of HIF-24-ARNT heterodimer 


We obtained well-diffracting crystals of the HIF-20-ARNT hetero- 
dimer in the apo form, and solved its structure at 2.4A resolution 
(Extended Data Table 1). There are two closely related complexes in 
the asymmetric unit, with a root mean squared deviation (r.m.s.d.) 
of less than 1.0A between all the main-chain atoms. Figure 1b 
shows the overall organization of the HIF-2x-ARNT heterodimer. 
Each domain from both subunits forms domain-domain interac- 
tions to establish a well-integrated quaternary architecture. The 
three domains of ARNT rotate and twist around the outer surface 
of the HIF-2« protein. This form of interaction involves nonequi- 
valent contacts between the two subunits. The asymmetric arrange- 
ment of HIF-2% and ARNT subunits is remarkable, given that each 
partner employs the same three conserved domains (Fig. 1b and 
Extended Data Fig. 2). 
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Figure 1 | Overall architectural features of the HIF-20-ARNT heterodimer 
and its comparison with CLOCK-BMALI. a, Schematic representation 

showing the domain arrangements of ARNT (green), HIF-2« (magenta) and 
HIF-1¢ (blue) proteins. b, Overall structure of the HIF-2x-ARNT heterodimer 


Despite their overall asymmetry, the HIF-2« and ARNT subunits 
bring their respective bHLH segments into convergence to form a 
single DNA-reading head (Fig. la, b). The two bHLH domains 
become linked in a pseudo-symmetric arrangement, ensuring their 
ability to read consensus hypoxia response element (HRE) sequences 
that are also pseudo-symmetric”’. With the DNA-reading head at one 
end of the complex, the opposite end joins the two PAS-B domains in 
an asymmetric fashion (Fig. 1b). The central core of the heterodimer is 
established by the interconnected PAS-A domains from both subu- 
nits. PAS domains may function as the signal recognition modules in 
the bHLH-PAS family**”. 

We separated the two subunits of HIF-2x-ARNT for closer indi- 
vidual inspection, to highlight their important differences (Fig. 1c). 
HIF-2« has its three domains forming mutual contacts to form a 
contiguous surface. But the three ARNT domains have no intra- 
molecular contacts, and are instead rotated and displaced apart 
(Fig. 1c). The lack of physical constraints among the ARNT domains 
could confer adaptability for heterodimerization with many other 
bHLH-PAS members (Extended Data Fig. 1a). All three of the 
ARNT domains become highly constrained by forming intermolecu- 
lar contacts within the HIF-2x-ARNT heterodimer (Fig. 1b, c). 


Comparison to the CLOCK-BMAL] architecture 

ARNT and BMALI are each common dimerization partners for the 
other mammalian bHLH-PAS members®. BMAL1 forms heterodi- 
mers with CLOCK and NPAS2. ARNT has a much larger repertoire 
of partners that includes all three HIF-« proteins, NPAS1, NPAS3, 
NPAS4, AHR, AHRR, SIM1 and SIM2 (Extended Data Fig. 1a). Next, 
we asked whether the HIF-2x-ARNT heterodimer shares the same 
architecture as CLOCK-BMALI. 

We compared the HIF-2x-ARNT and CLOCK-BMALI heterodi- 
mers and their individual proteins (Fig. 1c, d). First we find that while 
ARNT has its three domains separated, BMAL1 has connected PAS-A 
and PAS-B domain surfaces. Then comparing HIF-2« with CLOCK, 
we find different domain-domain arrangements in each protein 
(Fig. 1c, d). The differences among all four proteins are striking, since 
they share conserved and similarly embedded bHLH, PAS-A and 
PAS-B domains (Extended Data Fig. 1b). We next compared HIF- 
2a-ARNT and CLOCK-BMAL1 at the level of heterodimers, finding 
notable differences in their dimerization modes (Fig. 1c, d and 
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in two views, with domains labelled. c, The individual and heterodimeric 
structures of ARNT (green) and HIF-2a (magenta). d, The individual and 
heterodimeric structures of BMAL1 (cyan) and CLOCK (brown). 


Extended Data Fig. 3). Given that each of BMAL1 and ARNT hetero- 
dimerize with non-overlapping subsets of bHLH-PAS proteins, these 
two distinct arrangements may represent how the two subsets of 
family members are architecturally arranged. 

We note that three of the domain-domain interfaces are similarly 
formed within the HIF-2x0-ARNT and CLOCK-BMALI heterodi- 
mers (see interfaces 1, 2 and 4 in Extended Data Fig. 4a, b). But a 
previously described interaction mode observed by NMR” and crys- 
tallography”, involving the isolated PAS-B domains of HIF-2a and 
ARNT, is not seen in our structure (Extended Data Fig. 4a, c). One 
cannot exclude that the previously observed mode of interaction is 
relevant during the assembly of the complex or upon recruitment of 
associated factors. A binding site for coactivator TACC3 on the sur- 
face of ARNT PAS-B domain has also been described recently’’, 
which is not fully accessible within our multi-domain structure 
(Extended Data Fig. 4d). 


Similar domain interfaces within two HIF heterodimers 


We next examined whether the crystallographic observations of 
domain-domain interfaces in HIF-20-ARNT are important for 
establishing heterodimer stability in cells. Figure 2a shows all six 
domain-domain interfaces observed in our crystal structure in detail. 
We probed these interfaces using a mutational strategy directed at the 
amino-acid side chains that form the interfacial contacts. The stabi- 
lities of the heterodimers within HEK293T cells were examined using 
aco-immunoprecipitation strategy that detects ifthe two subunits still 
assemble when the mutations are present (Fig. 2b). 

We found that most single and double mutations introduced at the 
observed domain-domain surfaces produced significantly destabiliz- 
ing effects on the HIF-2a-ARNT heterodimer (Fig. 2b). Mutations at 
interface 1, connecting the ARNT bHLH to the HIF-2x bHLH 
domain, and at interface 2, connecting ARNT PAS-A to the HIF-2« 
PAS-A, clearly compromised heterodimer stability (Fig. 2b). The 
domain connection linking the ARNT PAS-A to the HIF-2% PAS-B 
(interface 3) is a critical part of our structure, but the equivalent 
interface does not form in the CLOCK-BMALI structure”. 
Consistently, single and double mutations introduced at interface 3 
destabilized the HIF-2x0-ARNT heterodimer. Further mutations 
introduced at other domain-domain junctions also reduced hetero- 
dimer stability (Fig. 2b). These observations suggest that the domain 
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Figure 2 | Domain interfaces of the HIF-24-ARNT complex. a, Overall 
structure of HIF-20-ARNT with each domain labelled and domain interfaces 
numbered 1 to 6 (lower centre); and detailed interactions at those interfaces 
in close-up views (surrounding panels). Interfaces 1-4 are formed between 
ARNT (green) and HIF-2« (magenta), while 5 and 6 are intra-molecular 


interfaces observed in the crystal structure are indeed required for 
heterodimerization between HIF-2a and ARNT in cells. 

HIF-1o and HIF-2% have a much closer level of amino acid homo- 
logy to each other than they do to other family members (Extended 
Data Fig. 1b). Our data show that mutations that destabilized HIF- 
2a-ARNT also destabilized HIF-lo-ARNT (Fig. 2b). We infer from 
these data that the two heterodimers have closely related architec- 
tures. Our subsequent structural observations of both HIF-la-ARNT 
and HIF-20-ARNT in their DNA-bound states confirm this notion 
(see below). 


Distinct pockets for small-molecule binding 


Among the mammalian bHLH-PAS family members, AHR is known 
to function physiologically as a sensor of small molecules, binding to a 
diverse set of xenobiotic and endogenous molecules”. Another mem- 
ber, NPAS2, has been reported to be gas responsive and bind haem as 
a prosthetic group”. It is not known whether other members of this 
family have natural ligands. PAS domains in unrelated signal trans- 
duction protein families are known to act as versatile molecular sen- 
sors, to detect physical, chemical and environmental signals*”*’. 

Although no endogenous ligand has been identified for the HIF-« 
or ARNT proteins, a number of synthetic small molecules were dis- 
covered through screening campaigns*°*°. Ligands that reduce the 
activity of HIF-« proteins are sought as potentially useful therapies 
in cancers where HIF-« overexpression correlates inversely with 
patient survival’. The possibility that HIF-« proteins contain a 
pocket was initially motivated by the discovery of a large cavity filled 
with eight ordered water molecules in the isolated PAS-B domain 
from HIF-2« (refs 26, 27, 37). 

We directly investigated the binding modes of two previously 
described molecules (0X3 and acriflavine) within the HIF-2%- 
ARNT heterodimer using X-ray crystallography (Fig. 3a-e). The bind- 
ing site for acriflavine had not been known. However, acriflavine acts 
through the HIF-« proteins to inhibit tumour vascularization and 
mobilization of angiogenic cells into peripheral blood**. For 0X3, 
the binding site within the isolated PAS-B domain of HIF-2«% was 
previously examined™, and we asked whether its binding pocket is 
shared by acriflavine. 

Acriflavine contains a mixture of two structurally related compo- 
nents, proflavine and trypaflavin**. We began by showing that both 
components bind to HIF-1a and HIF-2« heterodimers with ARNT 
with similar affinities (Extended Data Fig. 5a, b). We obtained crystals 
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junctions in HIF-2«. b, Co-immunoprecipitation (co-IP) experiments showing 
the effects of ARNT mutations at interfaces 1-4 on the stability of ARNT 
heterodimers with HIF-2« or HIF-1o. Interface 5 was disrupted by mutations 
in HIF-1o or HIF-2« at their PAS-A domains. Full blot scans are shown in 
the Supplementary Fig. 1. 


of HIF-2%-ARNT with each of 0X3 and proflavine, then solved and 
refined these structures at ~2.8 A resolution (Extended Data Table 1). 
The electron density maps in Fig. 3b-e allowed us to visualize their 
binding sites unambiguously. 

We observed that 0X3 and proflavine use distinct binding pockets 
within the architecture (Fig. 3a). 0X3 becomes fully encapsulated 
within the PAS-B domain of HIF-2« (Fig. 3b, c). Therefore, the qua- 
ternary architecture of the heterodimer formed no apparent barriers 
for its access to that site. Proflavine binds to an unexpected site. It 
intercalates between the HIF-2«% PAS-B domain and the ARNT PAS- 
A domain (Fig. 3d, e). The creation of pockets between PAS domains 
had not been anticipated. Moreover, the binding site of proflavine is at 
the critical convergence point of these two domains (interface 3, 
Figs 2a and 3d, e). 

Amino acid residues R266 and V305 of ARNT form direct contacts 
with proflavine (Fig. 3e), but are also critical for maintaining the 
integrity of interface 3 (Fig. 2a). Our mutagenesis studies in Fig. 2b 
showed that this site is a critical node required for establishing stable 
HIF-a-ARNT heterodimers. Accordingly, the binding of acriflavine 
was reported to destabilize HIF-x-ARNT heterodimers in cells**. 
A previous study showed that acriflavine can also bind to each of 
the individual HIF-10 and HIF-2% subunits with a dissociation con- 
stant (Ka) of ~50 uM**. This binding is substantially weaker than the 
Ka of ~40nM we observed for binding to the HIF-la-ARNT and 
HIF-20%-ARNT heterodimers (Extended Data Fig. 5b). Nevertheless, 
it remains possible that acriflavine binding can occur at a second 
independent site on the HIF-o subunit alone. The PAS-B domain 
in HIF-« is another critical node for establishing the stability of het- 
erodimers, as it firmly interacts with two PAS domains from ARNT, 
as well as HIF-«’s own PAS-A domain (Figs 2a and 3a). We show in 
Extended Data Fig. 5c a plausible mechanism through which ligand 
binding to the HIF-2x PAS-B domain can influence heterodimer 
stability in the HIF-20-ARNT complex. 

We examined whether the amino acid residues in the PAS-B pock- 
ets of HIF-2« and HIF-1« are strictly conserved, or sufficiently dif- 
ferent to allow small-molecule discrimination (Extended Data Fig. 5d, 
e). Three 0X3-interacting residues differ in these two pockets: A277, 
$292 and $304 in HIF-2« are replaced by 1275, T290 and T302 (which 
have bulkier side chains) in HIF-1«. As previously suggested’, these 
differences cause a smaller cavity to form in HIF-1o, explaining why 
0X3 binds to the HIF-2« PAS-B domain selectively. Another small 
molecule, THS-044, also shows selective binding to the HIF-20 PAS-B 
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domain”’. However, the observed proflavine pocket uses a nearly 
identical set of amino acids within HIF-2x-ARNT and HIF-1lo- 
ARNT heterodimers (Extended Data Fig. 5f, g). Consistent with this 
observation, acriflavine was found to interact with both HIF forms*’, 
which we confirmed biochemically (Extended Data Fig. 5b). 

We also looked elsewhere for accessible small-molecule binding 
pockets in the HIF-20-ARNT architecture. As previously reported”, 
the PAS-B domain of ARNT contains another available cavity 
(Fig. 3f, g). The small molecule KG-548 has been shown to bind 
there, inhibiting its interactions with the coactivator TACC3**. Only 
some residues are conserved between the ARNT and HIF-2«% PAS-B 
pockets. Consistent with this observation, KG-548 binds only to 
ARNT, but not to HIF-2« (ref. 36). The PAS-B pocket of ARNT 
represents a third distinct pocket in the heterodimer for small- 
molecules binding. 

In addition, we also found two other accessible cavities within the 
HIF-20%-ARNT complex. These are observed in the PAS-A domains 
of each of HIF-2«% and ARNT proteins (Fig. 3h-k). It was reported 
previously that the NPAS2 protein binds haem through both its PAS- 
A and PAS-B domains”’, and some proteins outside the bHLH-PAS 
family use their PAS-A domains for ligand binding*®*'. Importantly, 
only about 60% of residues embracing the HIF-2« PAS-A pocket are 
identical with HIF-1«. Therefore, this pocket may offer selective bind- 
ing to ligands between the two HIF-« proteins, just as with their PAS- 
B pockets. The volumes of all the PAS pockets of HIF-2« and ARNT 
are shown in Extended Data Fig. 5h. These pockets can potentially 
bind endogenous molecules or synthetic compounds. The discovery 
of five distinct pockets in these complexes expands the possibilities for 
targeting HIF protein activities with drug-like molecules for thera- 
peutic purposes. 
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Binding to hypoxia response elements 

To understand how HIF-la-ARNT and HIF-20%-ARNT complexes 
recognize HREs, we prepared segments consisting of the bHLH-PAS- 
A-PAS-B domains from HIF-1a, HIF-2% and ARNT, together with 
synthetic DNA duplex containing the core sequence of HRE (5’- 
TACGTG-3’). Crystal structures for HIF-lo-ARNT and HIF-20- 
ARNT complexes with DNA were solved and refined at 3.9 A and 
3.6 A resolutions, respectively (Extended Data Table 1). 

These structures confirm that the overall architectures of two 
HIF-a-ARNT heterodimers are indistinguishable (Fig. 4a, b), as 
our mutagenesis studies suggested (Fig. 2b). Moreover, HIF-lo- 
ARNT and HIF-20-ARNT engage their response elements in an 
identical fashion, using specific amino acids located on their 
bHLH domains. These domains insert «-helices into the major 
groove faces of the TACGTG sequences in a pseudo-symmetric 
fashion, allowing base-pair-specific hydrogen bonds and van der 
Waals interactions to form with residues from the bHLH helices 
(Extended Data Fig. 6a, b). The ARNT residues forming DNA con- 
tacts (R102, E98 and H94) were identically positioned in BMAL1 to 
recognize the GTG half site, as revealed by the crystal structure of 
isolated CLOCK-BMAL1 bHLH domain with E-box (5'-CACGTG- 
3') DNA”*®. We further verified the importance of these residues in 
HIF-20-ARNT by introducing mutations and detecting reduced 
DNA binding as a result (Extended Data Fig. 6c). 

A comparison of the DNA-bound and DNA-free forms of HIF-2a- 
ARNT shows important structural adaptations. As seen in Fig. 4c, the 
a1 helices in both HIF-2% and ARNT bHLH domains extend in size 
to bind to DNA. The «2 helices, which link the bHLH domains to 
the remainder of the quaternary architecture, undergo rotations to 
further facilitate DNA binding. The extension and rotation of the 
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Figure 4 | DNA-bound HIF-a—ARNT structures. a, b, Overall view of the 
HIF-2x-ARNT-DNA (a) and HIF-la-ARNT-DNA (b) complexes. 
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heterodimers (left); enlarged figure shows the dramatic conformational 
changes at the bHLH domains that accompany DNA binding (right). d, The 
interactions between HIF-2~ PAS-A domain and DNA (HRE site in blue) are 
shown in an overview (left) and in a close-up fashion (right). 


DNA-reading head enhances the heterodimer’s ability to engage the 
DNA response element optimally. 

In both heterodimers, we also found that the PAS-A domain of 
HIF-« cooperates with the bHLH domains to establish DNA binding 
(Fig. 4d). Mutations introduced at the DNA-binding surface of the 
HIF-2% PAS-A domain impacted DNA affinity (Extended Data Fig. 
6c, d). The PAS-A domain contacts DNA at a distance six base pairs 
away from the hexameric core element (Fig. 4d and Extended Data 
Fig. 6b). This cooperation between the PAS-A and bHLH domains 
markedly extends the heterodimers’ footprints on DNA. PAS 
domains had not been known to be capable of forming DNA inter- 
actions. Specifically positioned amino acids within the PAS-A 
domains of ARNT’s partners may modulate the DNA-binding affin- 
ities of other bHLH-PAS heterodimers. 


Mapping cancer mutations and phosphorylation sites 


There is strong evidence for altered HIF functions in many human 
cancers’*. Germline mutations in the von Hippel-Lindau (VHL) 
tumour suppressor gene, encoding a HIF-« E3 ubiquitin protein 
ligase, cause a dominantly inherited familial cancer syndrome”. 
The resulting mutational inactivation of VHL protein leads to 
the accumulation of HIF-1a and HIF-2« proteins*!’. VHL disease 
is characterized by susceptibility to retinal and central nervous sys- 
tem haemangioblastomas, clear cell renal cell carcinomas (ccRCC), 
pancreatic islet cell tumours and renal, pancreatic and epididymal 
cysts**. 

The HIFIA gene is highly overexpressed in a number of cancers 
including colon, breast, gastric, lung, skin, ovarian, pancreatic, pro- 
state and renal cancers as a result of intratumoral hypoxia’’. In the 
case of renal cancer, recent genetic and functional studies also impli- 
cated HIF-1o as a tumour suppressor gene**. Loss of HIF-1o express- 
ion was found to be associated with worse survival rate in patients 
with ccRCC*. Changes in expression levels of HIF-« proteins 
present in a wide spectrum of cancers occur through a multitude of 
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mechanisms, including hypoxia, loss-of-function of tumour sup- 
pressor genes, activating mutations in oncogenes, and activation of 
kinase signalling pathways”. 

Missense mutations within each of HIF-1o and HIF-2« proteins 
have been isolated from different cancer biopsies***’. Using the 
COSMIC (Catalogue of Somatic Mutations in Cancer) database**, 
we mapped 19 such mutations that locate to the bHLH-PAS- 
A-PAS-B segments of these proteins (Extended Data Fig. 7a). These 
mutations are linked to cancers including stomach adenocarcinomas, 
endometrial carcinomas, brain gliomas, lung adenocarcinomas, hepa- 
tocellular carcinomas and skin melanomas. 

Extended Data Figure 7 shows the positions of these mutations 
within both heterodimers. One group of mutations maps directly to 
sites that establish DNA binding. A second group maps to the domain 
interfaces, while the rest locate to the pockets in PAS domains. 
Notably, some mutations at PAS-B domains specifically alter the 
interior pockets where we observed 0X3 binding. These include resi- 
dues H248 and A277 inside the HIF-2% pocket (Extended Data Fig. 
7b), which contact 0X3 (Fig. 3c and Extended Data Fig. 5d). M2501 
alters the equivalent PAS-B pocket within HIF-la (Extended Data 
Figs 5e and 7c). The HIF-1% mutation V116E identified from ccRCC 
patients has been shown to impair HIF-1o activity”. As shown in 
Extended Data Fig. 7c, V116 is positioned on an o-helix (Ca) in the 
PAS-A domain, where its hydrophobic side chain is required to sta- 
bilize this domain. Phosphorylation can also regulate HIF-« activity 
on some target genes’. HIF-1o0 and HIF-2a are phosphorylated in 
their PAS-B domains*”*°. We examined these sites within the hetero- 
dimeric complexes (Extended Data Fig. 8). 


Concluding remarks 


Hypoxia in solid tumours contributes to aggressive phenotypes 
and accounts for the failure of many current therapies’. The HIFs are 
versatile transcription factors responsible for progression of hypoxia- 
induced tumours'’. HIF-regulated programs drive erythropoiesis, 
angiogenesis and the metabolic switch to glucose utilization for fuel. 
The HIFs further drive drug efflux, cell proliferation and anti-apoptosis 
programs’’. The coordination of all these programs helps to promote 
tumour growth, drug resistance and ultimately leads to lower patient 
survivability’’. As many classic chemotherapy agents fail to impact 
hypoxic cancers, novel strategies are needed to target tumour growth 
under hypoxic conditions”. Targeting of the HIF-10 and HIF-2« pro- 
teins with a broad variety of therapeutic small-molecule modulators 
should be possible given the five distinct pockets visualized here. As the 
HIF pathways are also critical for driving a number of other diseases in 
humans, including anaemia and cardiac ischaemia, the discovery of 
drug-binding pockets opens the possibility for therapeutic approaches 
directed at the HIF proteins themselves, in place of the HIF-modifying 
PHD enzymes. 

Other ligand-responsive mammalian bHLH-PAS family members 
include AHR™ and NPAS2”. These factors bind to environmental 
and cellular ligands. The HIF-«% proteins may also recognize endo- 
genous ligands that provide cues about the environmental or meta- 
bolic status of cells. The observation that several cancer-related 
missense mutations map to these pockets suggests that endogenous 
ligand(s) are likely to exist. Identifying these natural ligands and their 
effects on the heterodimers would further propel our understanding 
of HIF functions and drive the discovery of small-molecule modula- 
tors. It is intriguing that some pockets form at critical nodes that 
establish heterodimer stability (Extended Data Fig. 9). Protein 
stability is already a well-established mechanism by which the HIF- 
proteins are regulated physiologically’. 

Finally, we point to the differences between the HIF-x-ARNT and 
CLOCK-BMALI architectures as being indicative that a single mode 
of heterodimerization is not shared by all mammalian bHLH-PAS 
members. HIF-x-ARNT and CLOCK-BMALI heterodimers 
employ uniquely arranged domain-domain junctions to establish 
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their stabilities. Their integrated subunits and domain arrangements 
provide the pathways through which signals may be communicated 
allosterically across domains. These signals can include endogenous 
ligands, disease mutations and post-translational modifications. 
Future studies that explore the allosteric pathways in these hetero- 
dimers can also advance the discoveries of therapeutic molecules. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Plasmid construction and site-directed mutagenesis. For the protein overex- 
pression in Escherichia coli, mouse ARNT (GenBank accession AAH12870.1, 
residues 82-464) was cloned into the vector pMKH as previously described*'. 
Meanwhile, mouse HIF-2« (GenBank AAH57870.1, residues 3-361) and HIF-10 
(GenBank AAH26139.1, residues 13-357) were cloned into the vector pSJ2, 
respectively. For the co-immunoprecipitation, full-length ARNT was cloned into 
the pCMV-Tag]l vector (C-terminal Flag-tagged), while full-length HIF-20 and 
HIF-1a% were cloned into the pCMV-Tag4 vector (C-terminal Myc-tagged), 
respectively. Site-directed mutagenesis was performed as previously”’, and con- 
firmed by DNA sequencing. 

Protein expression and purification. The recombinant plasmids pSJ2-HIF-2 
and HIF-1o were respectively co-transformed along with pMKH-ARNT into BL21 
competent cells (Agilent Technologies). Following the IPTG-induced overnight 
expression at 16 °C, cell pellets were lysed by sonication, and supernatant samples 
after centrifugation were applied onto the packed His-Bind resin (Novagen). The 
bound proteins were further purified by an SP Sepharose column (GE Healthcare), 
and the eluted fractions were loaded on a Superdex 200 pg gel filtration column. To 
prepare protein-DNA complexes, 21mer double-strand DNA (forward: 5’- 
GGCTGCGTACGTGCGGGTCGT-3’ and reverse: 5’-CACGACCCGCACGTA 
CGCAGC-3’) was mixed with the heterodimeric proteins at a molar ratio of 
1.2:1. Then the mixture was loaded again on the gel filtration column in the running 
buffer containing 20mM Tris pH 8.0 and 150mM NaCl. Peak fractions of the 
protein-DNA complex were pooled and supplemented with 10 mM DTT. 
Crystallization and X-ray data collection. Crystallization of the HIF-20-ARNT 
complex was carried out using the sitting-drop vapour diffusion method at 16 °C, 
by mixing equal volume of protein (4 mg ml ') and reservoir solution containing 
2% tacsimate pH 7.0, 6% PEG3350. Before being flash frozen in liquid nitrogen, 
crystals were soaked in reservoir plus 30% PEG400 as the cryoprotectant. HIF- 
20-ARNT-0X3 (antagonist 2, Sigma SML0883) complex was co-crystallized in a 
similar condition with 100 uM compound added to the protein solution. HIF-2%- 
ARNT-proflavine crystals were obtained by soaking apo crystals in the drops 
containing 100 iM acriflavine (Sigma A8126) overnight. HIF-20-ARNT-DNA 
crystals were grown in sitting drops formed by equal volume of complex 
(4mg ml ') and reservoir consisting of 4% tacsimate pH 6.0, 10% PEG3350, 
and then transferred stepwise to cryoprotectant containing up to 30% PEG 400 
before being flash frozen. HIF-1la-ARNT-DNA crystals were grown also at 16 °C 
in hanging drops, with the reservoir containing 100 mM MES pH 6.0, 100mM 
CaAc, and 15% PEG 400. Before being flash frozen, these crystals were air 
dehydrated by serially transferring to 40 pl drops of reservoir with increasing 
concentrations of PEG 400 (up to 40%) for about 10min at each step. 
Diffraction data were collected at the Argonne National Laboratory SBC-CAT 
19ID beamline at 100 K, and processed using the HKL3000 program>. 
Structure determination and refinement. The structure of HIF-2x-ARNT com- 
plex was solved by molecular replacement with the program Phaser”, using 
modified single domains from coordinates 4F3L”, 3F1P”” and 4M4X°! as 
the search models. Further manual model building was facilitated by using 
Coot™, combined with the structure refinement using phenix.refine® and the 
PDB_REDO web server®. The rest of the structures were then solved by molecu- 
lar replacement using HIF-2%-ARNT complex as the starting model, and further 
refined in the similar way. The diffraction data and final refinement statistics are 
summarized in Extended Data Table 1. The Ramachandran statistics, calculated 
by Molprobity”’, are 96%/0, 92%/0, 93%/0, 88%/0.35%, 82%/0.41% (favoured/ 
outliers) for the HIF-20-ARNT apo, HIF-2x0-ARNT-0X3, HIF-2a-ARNT-pro- 
flavine, HIF-2x-ARNT-DNA and HIF-la-ARNT-DNA complexes, respect- 
ively. All the structural figures were prepared using PyMOL (The PyMOL 
Molecular Graphics System, Version 1.3, Schrédinger, LLC). 
Co-immunoprecipitation. Similarly to our previous work°', HEK293T cells 
(ATCC CRL-3216, not authenticated nor tested for mycoplasma contamination) 


ARTICLE 


were seeded in 10 cm dishes and one day later transfected with 2 1g pCMV-Tag4- 
HIF-2« or HIF-la (WT or mutants) and 641g pCMV-Tagl-ARNT (WT or 
mutants) plasmids using 16 pl jetPRIME regent (Polyplus-transfection). After 
overnight incubation, medium was refreshed with 50 1M DP (2,2’-dipyridine, 
Sigma D216305). Another 24h later cells were harvested and shortly sonicated in 
700 ul lysis buffer (1X TBS with 1mM EDTA, 1% TRITON X-100 and 1x 
protease inhibitor cocktail) before quick spinning. After the protein concentra- 
tion measurement for each sample, 40 1g of supernatant was saved as input for 
western blot using monoclonal anti-Flag M2 antibody (Sigma F1804) and anti- 
Myc rabbit mAb (Cell Signaling 2278). Immunoprecipitation was performed with 
1 mg of supernatant (diluted to 1 ml with lysis buffer) and 40 ul of anti-Flag M2 
affinity gel suspension (Sigma A2220) according to the manufacturer’s instruc- 
tions, followed by western blot using the anti-Myc antibody. 

Compound binding assay. The intrinsic fluorescence signals of 10 nM acriflav- 
ine, proflavine (Sigma 131105) or trypaflavin (isolated from acriflavine by HPLC) 
were monitored using the BioTek Synergy 4 microplate reader (445 nm excita- 
tion/515 nm emission) in black 96-well plates. Purified HIF-2%-ARNT or HIF- 
la-ARNT protein complexes were mixed with compounds in buffer containing 
20 mM Tris pH 8.0 and 400 mM NaCl. The fluorescence quenching by gradually 
increased protein concentration was normalized to fraction bound, and the Kg 
values were calculated by fitting the curves in GraphPad Prism 6. 

Fluorescence polarization HRE DNA binding assay. The fluoresceinated dou- 
ble-strand HRE DNA was prepared by annealing 6-FAM labelled forward strand 
(5'-GGCTGCGTACGTGCGGGTCGT-3’) with the unlabelled reverse strand 
(5'-ACGACCCGCACGTACGCAGCC-3’) in the buffer consisting of 20 mM 
Tris pH 7.5, 1mM EDTA and 2mM MgCh. For the binding assay, 2nM DNA 
was incubated with purified complex proteins for 30 min, and final protein con- 
centrations were varied by serial dilution in binding buffer (20 mM Tris pH 8.0, 
50mM NaCl). The fluorescence polarization signals were recorded using black 
96-well plates on FlexStation 3 (Molecular Devices). The data were converted to 
fluorescence anisotropy, and K, values were calculated as above. 
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Extended Data Figure 1 | The mammalian bHLH-PAS family of transcrip- 
tion factors. a, Diagram showing the partner choices for heterodimerization 
between members of this family. ARNT is ubiquitously expressed, while 


ARNT2 (paralogue of ARNT) is mainly expressed in the central nervous 
system®’. b, Sequence identity comparison among the same domains from 
mouse ARNT, HIF-1a, HIF-2«, HIF-3a, CLOCK and BMALI proteins. 
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Extended Data Figure 2 | Structure of the HIF-2a-ARNT complex in cartoon mode. Overall structure of HIF-24-ARNT complex (centre), and individual 
domains of ARNT (top) and HIF-2« (bottom) with their secondary structures labelled. 
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HIF-2a:ARNT vs CLOCK:BMIAL1 


Extended Data Figure 3 | Comparison of the overall structures of HIF-20- | BMAL1, respectively. Colours used in labels match those used in figures for the 
ARNT and CLOCK-BMALI complexes. Two complexes are superposed same components. 
by aligning the bHLH domains (a) or PAS-B domains (b) of ARNT and 
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Extended Data Figure 4 | Domain interfaces of the HIF-20-ARNT complex 
and the comparison with other structures. a, Overall structure of HIF-20- 
ARNT complex with domain interfaces numbered (left), and the spatial 
arrangement of each interface (right). b, The dimer interfaces formed by 
equivalent domains in the CLOCK-BMAL1 complex, corresponding to the 
interfaces 1, 2 and 4 of HIF-2x-ARNT heterodimer. c, The B-sheet-mediated 
antiparallel interface of isolated HIF-2x-ARNT PAS-B complex with 
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ARNT:TACC3 vs HIF-20:ARNT 
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Interface 5 
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Interface 3 
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Interface 4 
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Interface 6 
PAS-A bHLH 
PAS-B PAS-B c PAS-B PAS-B 


HIF-20™t: ARNTmut 
(PDB ID 3F1P) 


PAS-B 


bHLH 


mutations. d, The crystal structure showing the interaction between ARNT and 
TACCS3 (left), and its superimpositions with HIF-20-ARNT complex 
through the PAS-B domain of ARNT in two views (right). Colours used in 
labels match those used in figures for the same components. The binding 
position of TACC3 peptide with respect to ARNT PAS-B cannot be fully 
accommodated in our quaternary structure, as some steric clashes would result 
involving TACC3 and the HIF-2% PAS-B domain. 
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Extended Data Figure 5 | Ligand binding sites in the HIF-24-ARNT 


complex. a, Acriflavine is a mixture of proflavine and trypaflavin. b, Binding 
tests of acriflavine, proflavine and trypaflavin to protein complexes of HIF-20- 


ARNT (left) and HIF-1a-ARNT (right). The calculated Ky values for 


acriflavine, proflavine and trypaflavin were 41 nM, 31 nM and 34 nM for HIF- 
2a-ARNT, and 40 nM, 29 nM and 56 nM for HIF-1¢-ARNT, respectively. 
Representative data of at least two experiments, shown as mean = s.d. from 
three technical replicates. c, A proposed mechanism by which 0X3 binding can 
destabilize the HIF-2«a-ARNT heterodimer. Shown is the proximity of 0X3 
within the PAS-B domain of HIF-2« and R366 from the PAS-B domain of 
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ARNT. 0X3 binding can potentially influence domain-domain interactions 
mediated by R366. We showed in Fig. 2b that R366 is a highly sensitive site for 
maintaining the stability of HIF-2x-ARNT heterodimer. 0X3 binding to the 
PAS-B domain of HIF-2« could further influence interfaces 3, 4 and 5 (shown 
in Extended Data Fig. 4a). d-g, 0X3 (d) binds at the inner pocket of HIF-20 
PAS-B domain, while proflavine (f) binds at the interfacial pocket formed by 
ARNT PAS-A and HIF-2« PAS-B domains; corresponding positions in the 
structures of HIF- 1a PAS-B (e) and HIF-1a-ARNT-DNA complex (g) are also 
shown. The yellow dotted lines represent hydrogen bonds. h, Pocket volumes 
calculated with CASTp program® at default 1.4 A probe radius. 
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Extended Data Figure 6 | Interactions between the HIF-2a-ARNT complex 
and DNA. a, Recognition of the HRE site by the bHLH domains. Overall 
structure of HIF-2x-ARNT-DNA with each domain labelled (left); and 
detailed interactions between the core HRE site (blue) and the bHLH domains 
of HIF-2 (upper right) or ARNT (lower right) are shown, with grey meshes 
showing 2F, — F_ electron density contoured at 0.80. Hydrogen bonds (2.5— 
3.5 A) are indicated by the brown dotted lines, while hydrophobic contact 
(3.6 A) is shown by the green one. b, Schematic recognition diagram of HIF- 
2a-ARNT to the HRE sequence (blue) on DNA. The brown and green dotted 
lines represent hydrogen bonds and hydrophobic contact, respectively. The 
black arrow indicates the nucleotide interacting with residues N184 and K186 
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from HIF-2% PAS-A domain. Additional basic residues from HIF-2« and 
ARNT bHLH domains (HIF-2« K16, K18, R20, R24, R26 and ARNT R91, R99, 
R101) that can interact with DNA (mainly through the phosphate backbone) 
are also labelled in Extended Data Fig. 9. c, HRE DNA binding assay of HIF-2a- 
ARNT protein complex in wild type (WT) or point-mutated forms using 
fluorescence polarization. Representative data of at least two experiments are 
shown as mean + s.d. from three technical replicates. Calculated approximate 
Kg values are shown in parentheses. d, The interactions between HIF-2a 
PAS-A domain and DNA, with the GH loop (including residues N184 and 
K186) and interacting nucleotides meshed by 2F, — F. map at 0.80. 
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a 
Location Possible Role Primary Tissue (Subtype) Associated Histology 
HIF-2a 
K18E  bHLH ai DNA interaction Stomach Adenocarcinoma 
A23V bHLH at DNA interaction Endometrium Endometrioid carcinoma 
V47M — bHLH a1-a2 loop Interface 6 (bHLH/PAS-A) Central nervous system (brain) Glioma 
F98L PAS-AAB Internal stability Large intestine (colon) Adenocarcinoma 
R166L PAS-AGB Internal stability Kidney Clear cell renal cell carcinoma 
1223M_ PAS-AIB Interface 2 (PAS-A/PAS-A) Lung Adenocarcinoma 
H248N PAS-BAB Internal stability Large intestine (colon) Adenocarcinoma 
R275H PAS-B Da-Ea loop _ Internal stability Cervix Squamous cell carcinoma 
A277P PAS-B Ea Internal stability Lung Squamous cell carcinoma 
E279V PAS-BEa Internal stability Liver Hepatocellular carcinoma 
HIF-1a 
K19Q_ bHLH at DNA interaction Endometrium Endometrioid carcinoma 
R30Q_ bHLH at DNA interaction Skin Malignant melanoma 
L541 bHLH a1-a2 loop Interface 6 (bHLH/PAS-A) Kidney Clear cell renal cell carcinoma 
V116E PAS-ACa Internal stability Kidney Clear cell renal cell carcinoma 
M120T PAS-ACa Internal stability Large intestine (colon) Adenocarcinoma 
M1711 PAS-AGB Internal stability Kidney Clear cell renal cell carcinoma 
M2501 PAS-BA£-BB loop Internal stability Lung Adenocarcinoma 
L262S PAS-BCa Internal stability Skin Malignant melanoma 
V3411 PAS-B Ip. Internal stability Endometrium Endometrioid carcinoma 
c 


= 


PAS-B 


PAS-B 


PAS-A- PAS-A 


K19Q 


HIF-2a:ARNT:DNA HIF-10:ARNT:DNA 


publications**°*™. b, c, Spatial distribution of the HIF-20 (b) and HIF-la 

(c) mutations in the heterodimers. Arrows point to close-up views of several 
regions in each heterodimer. The sequence positions of these mutations are also 
labelled in Extended Data Fig. 9. 


Extended Data Figure 7 | Locations of cancer-related missense mutations 
on the HIF-a-ARNT heterodimers. a, Detailed information of the selected 
cancer-related mutations in HIF-2o and HIF-1«. The information about tissue 
and histology was adopted from the COSMIC database* and other 
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Extended Data Figure 8 | Spatial positions of phosphorylation sites at the 
HIF-a PAS-B domains in the context of HIF-a-ARNT complexes. The 
phosphorylation sites T324 of HIF-2« (a) and $247 of HIF-1« (b) are shown as 
yellow sticks. Their corresponding residues T322 of HIF-1o% and S249 of HIF- 
2a are also shown. Colours used in labels match those used in figures for the 
same components. HIF-2« is phosphorylated at residue T324 by protein kinase 
D1, but the equivalent residue in HIF-1% (T322) cannot be phosphorylated”. 


HIF-10:ARNT:DNA 


The differential positioning of their threonine residues next to a non-conserved 
loop between PAS-A and PAS-B domains may explain why they cannot be 
similarly phosphorylated. Casein kinase 1 (CK1) can phosphorylate HIF-1o 
at S247 (ref. 50). We predict that the equivalent residue $249 in HIF-2% may 
also be the target of CK1 (ref. 9), since the local environments for these residues 
are indistinguishable, with both residues being solvent accessible. 
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Extended Data Figure 9 | Comparison of mouse ARNT, BMAL1, CLOCK, __ labelled above (for ARNT) or below (for HIF-2«) the sequences. In addition, 
HIF-1a and HIF-2a proteins. Sequence alignment of these five proteins the HIF-2o (magenta) or HIF-1o (blue) residues with cancer-related mutations 
includes the bHLH, PAS-A and PAS-B domains. Alignment was conducted by _ (mainly selected from the COSMIC database**) and phosphorylation sites at 
ClustalW2® and then processed by ESPript 3.0%. The secondary structures of | the PAS-B domains are also labelled, further below the secondary structures of 
each domain of HIF-2x-ARNT complex, and residues involved in domain HIF-2a. 

interfaces, protein-DNA or protein-compound interactions, are differently 
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Extended Data Table 1 | Data collection and refinement statistics 


ARTICLE 


HIF-2a:ARNT HIF-2a:ARNT: HIF-2a:ARNT: HIF-2a:ARNT: HIF-1a:ARNT: 
apo 0X3 Proflavine DNA DNA 
Data collection 
Space group P 1 P14 P14 P 43 P 43 
Cell dimensions 
a, b, c (A) 49.0, 76.5, 98.8 49.1, 76.3, 98.0 48.9, 76.4, 98.3 65.7, 65.7, 245.9 66.7, 66.7, 243.6 
a, B,v (°) 90.1, 90.2, 73.1 89.9, 90.0, 73.2 89.8, 89.6, 73.2 90, 90, 90 90, 90, 90 
Resolution (A) 50.0-2.35 (2.39- 50.0-2.87 (2.92- 50.0-2.80 (2.85- 50.0-3.60 (3.66- 50.0-3.90 (3.97- 
2.35)* 2.87) 2.80) 3.60) 3.90) 
Reym OF Rmerge 6.1 (70.0) 12.1 (74.7) 5.4 (69.3) 9.7 (72.2) 4.5 (79.0) 
ol 11.8 (1.8) 8.9 (1.3) 17.0 (1.6) 12.9 (2.1) 24.7 (1.8) 
Completeness (%) 98.4 (97.4) 97.9 (91.1) 98.5 (97.6) 99.5 (99.5) 99.1 (100) 
Redundancy 2.6 (2.5) 2.5 (2.2) 2.4 (2.2) 3.7 (3.5) 3.8 (3.9) 
Refinement 


Resolution (A) 


No. reflections 
Rworks Free 


No. atoms 
Protein/DNA 
Ligand 
Water 

B-factors 
Protein/DNA 
Ligand 
Water 

R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 


49.4-2.36 (2.40- 
2.36) 

55769 (2688) 
20.7/22.8 
(33.9/37.2) 


8924 
0 
139 


0.011 
1.40 


49.0-2.87 (2.96- 
2.87) 

30376 (2510) 
22.2/25.3 
(34.9/39.7) 


8695 
21 


49.2-2.79 (2.88- 
2.79) 

31508 (1754) 
18.8/21.5 
(24.8/29.3) 


32.9-3.59 (3.95- 
3.59) 

12131 (2992) 
25.4/32.3 
(30.3/35.2) 


5624 


29.8-3.90 (4.11- 
3.90) 

9578 (1356) 
26.2/34.2 
(38.7/43.9) 


5042 


One crystal was used for each structure. 


*Highest resolution shells are shown in parentheses. 
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The ubiquitin kinase PINK1 recruits 
autophagy receptors to induce mitophagy 


Michael Lazarou'+*, Danielle A. Sliter!, Lesley A. Kane", Shireen A. Sarraf', Chunxin Wang’, Jonathon L. Burman’, 
Dionisia P. Sideris', Adam I. Fogel! & Richard J. Youle! 


Protein aggregates and damaged organelles are tagged with ubiquitin chains to trigger selective autophagy. To initiate 
mitophagy, the ubiquitin kinase PINK1 phosphorylates ubiquitin to activate the ubiquitin ligase parkin, which builds 
ubiquitin chains on mitochondrial outer membrane proteins, where they act to recruit autophagy receptors. Using 
genome editing to knockout five autophagy receptors in HeLa cells, here we show that two receptors previously 
linked to xenophagy, NDP52 and optineurin, are the primary receptors for PINK1- and parkin-mediated mitophagy. 
PINK1 recruits NDP52 and optineurin, but not p62, to mitochondria to activate mitophagy directly, independently of 
parkin. Once recruited to mitochondria, NDP52 and optineurin recruit the autophagy factors ULK1, DFCP1 and WIPII to 
focal spots proximal to mitochondria, revealing a function for these autophagy receptors upstream of LC3. This supports 
anew model in which PINK1-generated phospho-ubiquitin serves as the autophagy signal on mitochondria, and parkin 
then acts to amplify this signal. This work also suggests direct and broader roles for ubiquitin phosphorylation in other 


autophagy pathways. 


Selective autophagy clears intracellular pathogens and mediates cel- 
lular quality control by engulfing cargo into autophagosomes and 
delivering it to lysosomes for degradation. Autophagy receptors bind 
ubiquitinated cargo and LC3-coated phagophores to mediate auto- 
phagy'*. Damaged mitochondria are removed by autophagy after 
activation of the kinase PINK] and the E3 ubiquitin ligase parkin 
(encoded by PARK2 in humans)**. After the loss of mitochondrial 
membrane potential or the accumulation of misfolded proteins, 
PINK] is stabilized on the outer mitochondrial membrane’, where 
it phosphorylates ubiquitin at Ser65 to activate parkin ubiquitin ligase 
activity’. Although the autophagy receptors p62 and optineurin 
(OPTN) have been shown to bind ubiquitin chains on damaged mito- 
chondria, their roles, and the roles of the other autophagy receptors in 
mediating mitophagy, are unclear*. 


Autophagy receptors in mitophagy 

To clarify autophagy receptor function during mitophagy, genome 
editing was used to knock out five autophagy receptors (designated 
pentaKO) in HeLa cells, which do not express endogenous parkin. 
DNA sequencing (Supplementary Table 1) and immunoblotting of 
TAXIBP1, NDP52 (also known as CALCOCO2), NBRI, p62 
(SQSTM1) and OPTN (Fig. 1a, lane 6) confirmed their knockout. 
We analysed mitophagy in pentaKOs by measuring the degradation 
of cytochrome C oxidase subunit II (COXII), a mitochondrial DNA 
(mtDNA)-encoded inner membrane protein, after mitochondrial 
damage with oligomycin and antimycin A. After oligomycin and 
antimycin A treatment, COXII was degraded in wild-type cells 
expressing parkin, but not in pentakKOs or ATGS knockout HeLa 
cells, indicating a block in mitophagy (Fig. 1b, c, Supplementary 
Table 1 and Extended Data Fig. 1a). As a second indicator of mito- 
phagy, mtDNA nucleoids were quantified by immunofluorescence 
(Extended Data Fig. 1b). After 24h of oligomycin/antimycin A treat- 
ment, wild-type cells were nearly devoid of mtDNA, whereas mtDNA 


was retained in pentaKOs and ATGS5 knockouts (Fig. 1d, e). Parkin 
translocated to mitochondria (Extended Data Fig. 1c), and MFN1 and 
TOM20 (also known as TOMM20) were degraded via the proteasome 
comparably in wild-type and pentaKOs (Fig. 1b and Extended Data 
Fig. 1d). mtDNA nucleoids clump after oligomycin and antimycin A 
treatment in ATG5 knockout cells, but not in pentaKOs, consistent 
with a reported role of p62 (refs 10, 11). 

The five endogenous receptors in wild-type cells (Extended Data 
Fig. 1c) and each receptor re-expressed in pentaKOs (Extended Data 
Fig. le, f) translocated to mitochondria after oligomycin/antimycin A 
treatment. However, in pentaKOs, only green fluorescent protein 
(GFP)-tagged NDP52, GFP-OPTN and, to a lesser extent, GFP- 
TAXIBPI1 rescued mitophagy (Fig. 1f, g). Another recently reported 
autophagy receptor, TOLLIP”, neither recruited to mitochondria nor 
rescued mitophagy after oligomycin/antimycin A _ treatment 
(Extended Data Fig. 1g-i). 

We generated OPTN and NDP32 single knockout, NDP52/OPTN 
double knockout (DKO), and NDP52/OPTN/TAX1BP1 triple knock- 
out cell lines (Supplementary Table 1 and Fig. 1a), and found no 
compensatory change in the expression of the remaining receptors. 
NDP52 or OPTN knockout alone caused no defect in mitophagy, 
whereas the NDP52/OPTN DKO and to a greater extent the 
NDP52/OPTN/TAXIBP1 triple knockout inhibited mitophagy 
(Fig. 2a-d and Extended Data Fig. 2a, b). The robust mitophagy 
observed in OPTN knockouts contrasts with a report indicating loss 
of mitophagy using RNA interference (RNAi)-mediated knockdown 
of OPTN in HeLa cells’. Although NDP52 and OPTN redundantly 
mediate mitophagy, they function non-redundantly in xenophagy”’. 
Their expression levels in human tissues indicate that OPTN or 
NDP52 may function more prominently in different tissues 
(Extended Data Fig. 2c). 

Mutations in autophagy receptors can lead to diseases such as 
primary open angle glaucoma (OPTN; E50K)"*, amyotrophic lateral 


1Biochemistry Section, Surgical Neurology Branch, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Maryland 20892, USA. +Present address: Department of 


Biochemistry and Molecular Biology, Monash University, Clayton, Melbourne 3800, Australia. 
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Figure 1 | Identifying autophagy receptors required for PINK1/parkin 
mitophagy. a, Wild-type (WT), OPTN knockout (KO), NDP52 knockout, 
NDP52/OPTN DKO (N/O DKO), NDP52/OPTN/TAXIBP1 triple knockout 
(N/O/Tx TKO), and pentaKO (NDP52/OPTN/TAX1BP1/NBR1/p62) HeLa 
cells were confirmed by immunoblotting (IB). kDa, kilodaltons. b, c, Cells as 
indicated with or without mCherry (mCh)-parkin were analysed by 
immunoblotting (b), and COXII levels were quantified (c). d, e, Representative 
images of mCh-parkin-expressing wild-type, pentaKO and ATGS5 knockout 
cells immunostained to label mtDNA (green) (d), and quantified for mitophagy 
(24 h oligomycin/antimycin A (OA)) (e). More than 75 cells were counted 
per sample. DAPI, 4’,6-diamidino-2-phenylindole. Scale bars, 10 jim. 

f, g, Lysates from pentaKO cells expressing mCh-parkin and GFP-tagged 
autophagy receptors were immunoblotted (f) and COXII levels were quantified 
(g). Data in c, e and g are mean + s.d. from three independent experiments. 
* P< 0.001 (one-way analysis of variance (ANOVA)). 


sclerosis (OPTN; E478G and Q398X, in which X denotes a stop 
codon)'* and Crohn’s disease (NDP52; V248A)'°. Defects in xeno- 
phagy occur when OPTN is mutated to block its phosphorylation by 
TANK-binding kinase 1 (TBK1; $177A) or its ubiquitin binding 
(D474N)"*"”. In pentaKOs, the UBAN domain-disrupting mutants 
OPTN (Q398X), OPTN(D474N) and OPTN(E478G) (Extended 
Data Fig. 2d) failed to translocate to mitochondria (Extended Data 
Fig. 2e, f) or rescue mitophagy (Fig. 2e and Extended Data Fig. 2g). 
OPTN(S177A) weakly rescued mitophagy and minimally translo- 
cated to mitochondria, whereas OPTN(E50K) robustly translocated 
and substantially rescued mitophagy (Fig. 2e and Extended Data 
Fig. 2e-g). NDP52(V248A) fully recruited to mitochondria and 
rescued mitophagy, but a mutant lacking the zinc-finger ubiquitin- 
binding domains (NDP52(AZF))"* did not (Extended Data Fig. 2h-k 
and Fig. 2f). Thus, ubiquitin binding by OPTN and NDP352 is neces- 
sary for mitophagy, and some disease-causing mutations prevent 
mitophagy. 
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TBK1 and OPTN cooperate in mitophagy 


TBK1 phosphorylation of OPTN at Serl77 increases its association 
with LC3 during xenophagy’, and the OPTN(E50K) mutation 
increases TBK1 and OPTN binding’. TBK1 auto-phosphorylation 
at Ser172 is indicative of TBK1 activation” and occurs in a parkin- 
dependent manner after 3h of oligomycin/antimycin A treatment, 
but only in cells expressing OPTN (Extended Data Fig. 3b, lanes 4 and 
10). Prolonged oligomycin and antimycin A treatment induces mod- 
erate TBK1 phosphorylation in the absence of parkin, but still requires 
PINK] (Extended Data Fig. 3c). To investigate TBK1 function during 
mitophagy, we generated TBK1 knockout, and TBK1/NDP52 and 
TBKI/OPIN DKO HeLa cells (Extended Data Fig. 3d and 
Supplementary Table 1). Parkin translocated to mitochondria in all 
cell lines, however, only TBK1/NDP52 DKOs displayed defective 
mitophagy (Extended Data Fig. 3e and Fig. 2g-j). Mitophagy in 
TBK1/NDP52 DKOs was rescued by wild-type TBK1 or phospho- 
mimetic OPTN (OPTN(S177D)), but not by kinase-dead TBK1 
(TBK1(K38M); Extended Data Fig. 3g-i). Thus, in the absence of 
NDP52, TBK1 is crucial for effective mitophagy via OPTN. 


Ubiquitin phosphorylation in mitophagy 

Although many autophagy receptors recruit to the mitochondria after 
parkin activation, it is unclear why only some of these function in 
mitophagy. Parkin-mediated mitophagy is driven by the PINK1 phos- 
phorylation of Ser65 of both ubiquitin®’?'” and the parkin UBL 
domain’. Because Ser65 phospho-ubiquitin is structurally unique, 
it may differentially interact with ubiquitin-binding proteins”. To 
determine whether OPTN is directly recruited to phospho-ubiquitin 
on mitochondria, we conditionally expressed PINK1 on undamaged 
mitochondria” in HeLa cells lacking parkin (Fig. 3a and Extended 
Data Fig. 4a). When PINK1A110-YFP-2XFKBP (in which PINK1 
lacks its N-terminal targeting sequence and is fused with yellow fluor- 
escent protein (YFP) and two tandem FKBP domains at the C ter- 
minus) is cytosolic, mCherry-tagged OPTN (mCherry-OPTN), 
mCherry-NDP52 and mCherry-p62 are also cytosolic (Extended 
Data Fig. 4b, c). After rapalog treatment, PINK1A110-YFP- 
2XFKBP localizes to mitochondria expressing the FKBP12/rapamy- 
cin-binding (FRB)-Fisl targeting construct where ubiquitin on 
mitochondrial surface proteins* (Extended Data Fig. 4a) can then 
be phosphorylated*”'*°”° . This recruits OPTN and NDP52 to mito- 
chondria (Fig. 3a, b), but not p62 (Fig. 3b and Extended Data Fig. 4c). 
OPTN and NDP52 recruitment requires PINK1 kinase activity 
(Fig. 3a, b) and receptor-ubiquitin binding, as the OPTN(D474N) 
and NDP52(AZF) mutants fail to recruit after rapalog treatment 
(Fig. 3b and Extended Data Fig. 4d, e). Therefore, the ubiquitin kinase 
activity of PINK1 recruits OPTN and NDP52 to mitochondria via 
ubiquitin-binding domains in the absence of parkin. 

To determine whether the observed autophagy receptor recruit- 
ment to mitochondria in the absence of parkin can induce mitophagy, 
we developed a sensitive fluorescence-activated cell sorting (FACS)- 
based mitophagy assay. We expressed mitochondrial-targeted 
mKeima (mt-mKeima, see Methods) in wild-type and pentaKO cells 
also expressing mitochondrial FRB-Fis] and PINK1A110-YFP- 
2 FKBP. mt-mKeima engulfment into lysosomes results in a spectral 
shift owing to low pH. Only 1% (range 0.89-1.15) of wild-type or 
pentaKO cells display mitophagy when PINK] is cytosolic. However, 
when PINK] is recruited to mitochondria with rapalog, mitophagy 
increases ~sevenfold in wild-type cells and ~eightfold with overex- 
pressed OPTN (Table 1 and Extended Data Fig. 5a). PentaKO cells 
showed no increase in mitophagy after targeting PINK1 to mitochon- 
dria (Table 1 and Extended Data Fig. 5b). When rescued with Flag/ 
haemagglutinin (HA)-tagged OPTN or NDP52, pentaKOs displayed 
an increase in mitophagy of more than fivefold or fourfold, respect- 
ively (Table 1 and Extended Data Fig. 5b-e). Rescue with Flag/HA- 
p62 or ubiquitin-binding mutants (OPTN(Q398X), OPTN(D474N) 
and NDP52 (AZF)) failed to increase mitophagy above baseline, but 
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Figure 2 | OPTN and NDP52 are redundant in PINK1/parkin mitophagy. 
a, b, Representative images of wild-type, pentaKO, OPTN knockout, 

NDP52 knockout and NDP52/OPTN DKO cells expressing mCh-parkin 
immunostained with anti-DNA (green) (a), and quantified for mitophagy (b). 
c, d, Cell lines from a were analysed by immunoblotting (c), and COXII 
levels were quantified (d). e, COXII levels quantified from pentaKO cells 
expressing mCh-parkin and rescued with wild-type or mutant GFP-OPTN 
(see Extended Data Fig. 2g for blots). f, COXII levels quantified from pentaKOs 
expressing mCh-parkin rescued with wild-type or mutant GFP-NDP52 


mCh-OPTN 


a xe AS 


(see Extended Data Fig. 2k for blots). g, h, Representative images of wild-type, 
TBK1 knockout, TBK1/OPTN (T/O DKO) and TBK1/NDP52 (T/N DKO) 
HeLa cells expressing mCh-parkin and immunostained with anti-DNA 

(g), and quantification of mitophagy (h). i, j, Cells from g were immunoblotted 
(i) and COXII levels were quantified (j). Data in b, d-f, h andi are mean = s.d. 
from three independent experiments. **P < 0.005, ***P < 0.001 (one-way 
ANOVA). NS, not significant. More than 75 cells were measured per confocal 
sample. Scale bars, 10 1m. For untreated and mCh-parkin images of a and g, 
see Extended Data Fig. 3a and f, respectively. 
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Figure 3 | PINK1 recruits OPTN and NDP52 independently of parkin to 
promote mitophagy. a, Representative images of HeLa cells expressing FRB- 
Fis1, wild-type or kinase-dead (KD) PINK1, PINK1A110-YFP-2FKBP 
(PINK1-YFP), mCh-OPTN or mCh-NDP3z2, treated with rapalog. Scale bars, 
10 um. b, Quantification of receptor translocation in cells from a and Extended 
Data Fig. 4c-e. More than 100 cells were counted per sample. c, Cells were 


mKeima pH 7 (488 nm) 


treated with rapalog and analysed by FACS for lysosomal-positive mt-mKeima. 
Representative data for wild-type or kinase-dead PINK1A110-YFP-2 FKBP 
in pentaKO cells with or without Flag/HA-OPTN expression. Data in b are 
mean ~ s.d. from three independent experiments. *P < 0.05, ***P < 0.001 
(one-way ANOVA). For images of untreated cells from a, see Extended Data 
Fig. 4b. 
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Table 1 | Rapalog-induced mitophagy 


Cell type PINK1-YFP-2xFKBP Receptor rescue Receptor mutation Parkin Rap Mean (%) Fold change 
WT PINK1 WT Vector = = Lis al 
+ 8.37 7.28 
OPTN WT _ - 1.08 _ 
+ 8.59 795 
PentaKO PINK1 WT Vector - aa 0.93 - 
+ 0.99 1.06 
OPTN WT - a 0.89 a 
+ 4.83 5.43 
PentaKO PINK1 WT Vector - + 1.13 7 
OPTN T + 5.27 4.66 
PINK1 KD Vector = + 0.88 0.78 
OPTN WT + 0.76 0.67 
PentaKO PINK1 WT Vector a + 0.93 - 
OPTN WT ee 2.97 3.19 
OPTN E50K iP 15.80 17.0 
OPTN S177A + 257 2.76 
OPTN Q398X at 0.87 0.78 
OPTN D474N + 0.76 0.67 
PINK1 WT Vector = + 0.87 = 
NDP52 WT + 3.50 3.98 
NDP52 V248A ot 2.80 3.18 
NDP52 AZF 4 0.47 0.53 
p62 WT 4 1.23 1.36 
PINK1 WT OPTN WT = - 1.10 = 
OPTN WT + 5.30 4.81 
OPTN WT + - 1.43 7 
OPTN WT + 23.60 16.50 


other mutants (OPTN(E50K), OPTN(S177A) and NDP52(V248<A)) 
rescued mitophagy (Table 1 and Extended Data Fig. 5c-f). 
OPTN(E50K) and OPTN(S177A) restored mitophagy as well as or 
better than wild-type OPTN (Table 1), differing from their response 
in the presence of parkin (Fig. 2e), probably owing to the lack of robust 
TBK1 activation when parkin is absent (Extended Data Fig. 3b). Here, 
enhanced OPTN(E50K) binding to TBK1 (ref. 19) may become 
advantageous by allowing OPTN phosphorylation by TBK1 in the 
absence of parkin, thus improving mitophagy. In the absence of 
TBK1 activation, OPTN is probably not phosphorylated at Ser177 
and thus is functionally similar to OPTN(S177A). Notably, the ubi- 
quitin kinase activity of PINK] is required, as kinase-dead PINK1 did 
not induce mitophagy (Table 1, Fig. 3c and Extended Data Fig. 5g). 
Parkin expression markedly increased mitophagy in Flag/HA- 
OPTN-expressing pentaKO cells (Table 1 and Extended Data Fig. 
5h), supporting the model that PINK1-phosphorylated ubiquitin 
recruits receptors for mitophagy, and parkin ubiquitination of mito- 
chondrial substrates amplifies this signal. 

Comparing mitophagy induced by oligomycin/antimycin A treat- 
ment in wild-type relative to PINK] knockout cells confirmed that 
endogenous PINK1 mediates mitophagy in the absence of parkin 
(Extended Data Fig. 6a, b). Re-expressing PINK] in PINK1-knockout 
cells rescued oligomycin/antimycin-A-induced mitophagy (Extended 
Data Fig. 6c, d). Furthermore, mCherry-OPTN is recruited to mito- 
chondria in the absence of parkin in a PINK1-dependent manner 
after prolonged exposure to oligomycin/antimycin A (Extended 
Data Fig. 6e, f). 

Given that PINK1 ubiquitin kinase activity can recruit OPTN 
and NDP52, we investigated autophagy receptor binding to phospho- 
mimetic (S65D) HA-ubiquitin in HeLa cells. Endogenous 
OPTN and NDP52 preferentially co-immunoprecipitate with 
HA-ubiquitin(S65D) (Extended Data Fig. 7a). Conversely, p62 
was present at equal levels in all co-immunoprecipitates (Extended 
Data Fig. 7a). Ubiquitin-modified and unmodified forms of 
OPTN and NDP52 were present in co-immunoprecipitates, and 
HA-ubiquitin(S65D) induced or preserved this modification 
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(Extended Data Fig. 7a). Co-immunoprecipitated samples treated 
with the deubiquitinase USP2 removed the ubiquitin-modified bands 
on OPTN and NDP52, yet OPTN and NDP52 retained 
HA-ubiquitin(S65D) binding (Extended Data Fig. 7b). The binding 
of endogenous receptors in HeLa cell cytosol to in vitro phosphory- 
lated strep-tactin-tagged ubiquitin (Extended Data Fig. 7c) showed 
that OPTN, but not p62, bound better to phospho-ubiquitin 
(Extended Data Fig. 7d, e). However, recombinant glutathione 
S-transferase (GST)-OPTN did not bind better to in vitro phosphory- 
lated Lys63-linked ubiquitin chains’’, indicating that OPTN may 
need additional factors or modification in vivo to bind Ser65-phos- 
phorylated ubiquitin preferentially. 


OPTN and NDP52 recruit upstream machinery 


Autophagy receptors are thought to function primarily by bridging 
LC3 and ubiquitinated cargo’’. In mCherry-parkin wild-type cells, 
GFP-LC3B accumulated in distinct puncta adjacent to mitochondria 
after oligomycin/antimycin A treatment (Extended Data Fig. 8a). 
Although oligomycin and antimycin A also induced GFP-LC3B 
puncta in pentaKOs, they were fewer and not near mitochondria 
(Extended Data Fig. 8a). Conversely, GFP-LC3B in ATG5 knockouts 
was near mitochondria, but not in puncta (Extended Data Fig. 8a). 
LC3B lipidation is retained in pentaKOs, but lost in ATG5 knockouts 
(Extended Data Fig. 8b). This indicates that ATGS is activated down- 
stream of PINK1, but independently of autophagy receptors, and that 
LC3 mitochondrial localization and lipidation are independent steps 
of mitophagy. 

OPTN and NDP32 interact with LC3B and LC3C, respectively, for 
Salmonella clearance’*”*. Beyond that, little is known about the 
specificity of LC3 family members towards autophagy receptors” or 
their involvement in mitophagy. We examined the recruitment of all 
LC3 and GABARAP family members to mitochondria in wild-type, 
pentaKO and NDP52/OPTN DKO cells. The oligomycin/antimycin- 
A-induced mitochondrial localization of GFP-LC3s in wild-type 
cells was absent in pentaKO cells, while only GFP-LC3B recruitment 
was inhibited in NDP52/OPTN DKOs (Fig. 4a and Extended 
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Figure 4 | Characterization of autophagy receptor function during 
mitophagy. a-d, mCh-parkin-expressing wild-type, NDP52/OPTN DKO and 
pentaKO cells were quantified for GFP-LC3A, GFP-LC3B and GFP-LC3C 
translocation to mitochondria (a), and for GFP—WIPI1 (b) or GFP-DFCP1 
(c) structures per cell (>100 cells counted for each sample), or were 
immunoblotted using phospho-specific anti-S757 and anti-S317 ULK1 
antibodies (d). e, mCh-parkin wild-type, NDP52/OPTN DKO and pentaKO 
cells stably expressing GFP-ULK1 were quantified for GFP-ULK1 puncta per 
cell (left), and the percentage of those puncta on mitochondria was determined 
(right). f, Representative cells from e were immunostained for TOM20 and 
GFP. g, PentaKO cells expressing FRB-Fis1, PINK1A110-YFP-2 FKBP, 
mCh-ULK]1 and Myc-tagged receptors were treated with rapalog and then 
imaged live. h, Quantification of mitochondrial ULK1 puncta in 

g. i, Quantification of mitochondrial ULK1 puncta in pentaKO cells expressing 
FRB-Fis1, PINK1A110-YFP-2FKBP, mCh-ULK1 and Myc-OPTN 
mutants, treated with rapalog then imaged live. Data in a-c, e, h and i are 
mean + s.d. from three independent experiments. **P < 0.005, ***P < 0.001 
(one-way ANOVA). For live-cell quantification, >75 cells were counted in a 
blinded manner. Quantifications in h and i were performed after removal of 
outliers; see Methods for details. Scale bars, 10 tm. a.u., arbitrary units. See 
Extended Data Figs 8c, 9b, c and 10d for representative images of a, b, c and 
i, respectively. See Extended Data Fig. 9e for untreated samples of f, and 
Extended Data Fig. 10c for untreated images of h and i. 


Data Fig. 8c). GFP-LC3C recruitment was inhibited in NDP52/ 
OPTN/TAXI1BP1 triple knockouts (Extended Data Fig. 8d, e), indi- 
cating that TAXIBP1 can recruit LC3C during mitophagy. 
GABARAP proteins did not recruit to mitochondria, indicating that 
they probably do not have a substantial role in mitophagy (Extended 
Data Fig. 9a). 

We also examined the involvement of WIPI1 and DFCP1, two pro- 
teins that mediate phagophore biogenesis upstream of LC3 (ref. 30), in 
mitophagy. In wild-type cells, oligomycin/antimycinA-induced foci of 
both GFP-WIPI1 and GFP-DFCP1, mostly localized on or near mito- 
chondria (Fig. 4b, c and Extended Data Fig. 9b, c). In NDP52/OPTN 
DKO cells, GFP—WIPI1 and GFP-DFCP1 foci were reduced, and 
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almost undetectable in pentaKO cells (Fig. 4b, c and Extended Data 
Fig. 9b, c). Despite this, the phosphorylation of beclin1 (ref. 31) was 
normal in both pentakKO and NDP52/OPTN DKO cells (Extended 
Data Fig. 9d), indicating that failure to recruit WIPI] and DFPC1 
was not due to a defective VPS34 (also known as PIK3C3) complex. 
GFP-DFCP1 recruitment in pentaKO cells was rescued by the express- 
ion of Flag/HA-OPTN or Flag/HA-NDP352, but not by Flag/HA-p62 
(Extended Data Fig. 10a). 

Although autophagy receptors are thought to function late in 
autophagy with LC3 (ref. 32), the deficit in WIPI1 and DFCP1 recruit- 
ment to mitochondria indicates a defect upstream in autophagosome 
biogenesis. ULK1 phosphorylation by AMPK at Ser317 and 
dephosphorylation at Ser757 (ref. 33), required for activation, occurs 
comparably in wild-type, NDP52/OPTN DKO and pentaKO cells 
(Fig. 4d). Despite this, ULK1 puncta appearance™ after oligomycin 
and antimycin A treatment is diminished by half in the NDP52/ 
OPTN DKO cells, and by more than 80% in pentaKO cells (Fig. 4e, 
f). Flag/HA-OPTN or Flag/HA-NDP352, but not Flag/HA-p62, res- 
cued GFP-ULK1 localization in pentaKO cells (Extended Data Fig. 
10b). Overall, these data indicate that NDP52 and OPTN recruit 
ULK1 to initiate mitophagy. 

We next assessed whether ubiquitin phosphorylation, independent 
of parkin, is also sufficient to recruit ULK1 to mitochondria. Rescue of 
pentaKO cells expressing FRB-Fis] and PINK1A110-YFP-2FKBP 
with Myc-OPTN or Myc-NDP52 resulted in mitochondrial ULK1 
puncta after rapalog treatment (Fig. 4g, h). Myc-OPTN(E50K) also 
rescued ULK1 recruitment to mitochondria, but mutant Myc- 
OPTN(Q398X), which is associated with amyotrophic lateral scler- 
osis, did not (Fig. 4i and Extended Data Fig. 10d). ULK1 recruitment 
was restored by Myc-OPTN(F178A) (Fig. 4i and Extended Data Fig. 
10d), a mutation that disrupts OPTN association with LC3 (ref. 13), 
indicating that ULK1] recruitment is not through LC3 interaction and 
occurs upstream of LC3 (ref. 34). Taken together, our data show that 
PINK] ubiquitin kinase activity is sufficient to recruit the autophagy 
receptors and upstream autophagy machinery to mitochondria to 
induce mitophagy. 


Conclusions 


Through the genetic knockout of five autophagy receptors, we have 
defined their relative roles in mitophagy and identified their unanti- 
cipated upstream involvement in autophagy recruitment machinery. 
p62 and NBRI are dispensable for parkin-mediated mitophagy; 
OPTN and NDP352 are the primary, yet redundant, receptors. We 
also uncovered a new and more fundamental role for PINK1 in mito- 
phagy: to induce mitophagy directly through phospho-ubiquitin- 
mediated recruitment of autophagy receptors. We propose that 
PINK] generates the novel and essential signature (phospho-ubiqui- 
tin) on mitochondria to induce OPTN and NDP352 recruitment and 
mitophagy; parkin acts to increase this signal by generating more 
ubiquitin chains on mitochondria, which are subsequently phos- 
phorylated by PINK1. Our findings clarify the role of parkin as an 
amplifier of the PINK1-generated mitophagy signal, phospho-ubiqui- 
tin, which can engage the autophagy receptors to recruit ULK1, 
DFCP1, WIPI1 and LC3 (see model in Extended Data Fig. 10e). 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cell culture, antibodies and reagents. HEK293T, HeLa and PINK1 knockout 
cells** were cultured in DMEM (Life Technologies) supplemented with 10% (w/v) 
FBS (Gemini Bio Products), 10 mM HEPES (Life Technologies), 1 mM sodium 
pyruvate (Life Technologies), non-essential amino acids (Life Technologies) and 
GlutaMAX (Life Technologies). HeLa cells were acquired from the ATCC and 
authenticated by the Johns Hopkins GRCF Fragment Analysis Facility using STR 
profiling. All cells were tested for mycoplasma contamination bimonthly using 
the PlasmoTest kit (InvivoGen). Transfection reagents used were: effectene 
(Qiagen), lipofectamine LTX (Life Technologies), avalanche-OMNI (EZ Bio- 
systems), X-tremeGENE HP (Roche) and X-tremeGENE 9 (Roche). 

The following rabbit monoclonal and polyclonal antibodies were used: beclin, 
pULK1(S317), pULK1(S757), TBK1, pTBK1(S172), NDP52, TAX1BP1, ATGS, 
actin, and haemagglutinin (Cell Signaling Technologies); GAPDH and LC3B 
(Sigma); ULK1 and TOM20 (Santa Cruz Biotechnology); OPTN (Proteintech); 
GFP (Life Technologies); pSer65 ubiquitin (Millipore) and MFN1 was generated 
previously**. Mouse monoclonal antibodies used were: NBR1 and p62 (Abnova), 
COXII (Abcam), parkin (Santa Cruz Biotechnology), DNA (Progen Biotechnik) 
and ubiquitin (Cell Signaling). Chicken anti-GFP (Life Technologies) was also 
used. For catalogue numbers, see Supplementary Table 1. Human tissue panel 
blots were purchased (NOVUS Biologicals). 

Generation of knockout lines using TALEN and CRISPR/Cas9 gene editing. 
To generate knockout cell lines, transcription activator-like effector nuclease 
(TALEN) and CRISPR guide RNAs (gRNAs) were chosen that targeted an exon 
common to all splicing variants of the gene of interest (listed in Supplementary 
Table 1). TALEN was used to generate the OPTN knockout HeLa cell line. The 
TALEN constructs were generated by sequential ligation of coding repeats into 
pcDNA3.1/Zeo-Talen(+ 63), as previously described”. The CRISPR/Cas9 sys- 
tem generated previously*® was used to knockout ATG5, NDP52, TAX1BP1, 
NBRI, p62 and TBK1. Oligonucleotides (Operon) containing CRISPR target 
sequences were annealed and ligated into Alfll-linearized gRNA vector 
(Addgene 41824)". For CRISPR/Cas9 gene editing, HeLa cells were transfected 
with gRNA constructs, hCAS9 (Addgene 41815) and pEGFP-C1 (Clontech). For 
TALEN gene editing, HeLa cells were transfected with OPTN TALEN constructs 
and pEGFP-Cl1. Two days after transfection, GFP-positive cells were sorted by 
FACS and plated in 96-well plates. Single colonies were expanded into 24-well 
plates before screening for depletion of the targeted gene product by immuno- 
blotting. As a secondary screen of some knockout lines, genomic DNA was 
isolated from cells and the genomic regions of interest were amplified using 
PCR followed restriction enzyme digestion analysis (primers listed in 
Supplementary Table 1). Sequencing of targeted genomic regions of knockout 
lines was also conducted to confirm the presence of frameshifting indels in the 
genes of interest (Supplementary Table 1). To generate multiple gene knockout 
cell lines, parental cell lines were transfected sequentially with one or multiple 
gRNA constructs to generate desired knockout lines. Parental cell lines are out- 
lined in Supplementary Table 1. 

Cloning and generation of stable cell lines. pMXs-puro-GFP-WIPI1 and 
pMXs-puro-GFP-DFCP1 were a gift from N. Mizushima, and pMXs-IP-GFP- 
ULK1 was purchased from Addgene (38193). To generate pBMN-mEGFP-C1, 
mEGFP-C1 (Addgene 36412) was PCR amplified (together with the multiple 
cloning site) and cloned into pBMN-Z at BamHI/Sall sites using the Gibson 
Cloning kit (New England BioLabs) according to the manufacturer’s instructions. 
The BamHI and Sall sites used to insert mEGFP-C1 were not regenerated. The 
following GFP-tagged plasmids were generated by PCR amplification of open 
reading frames followed by ligation into pBMN-mEGFP-Cl: OPTN, NDP52, 
p62, TAX1BP1, NBR1, LC3A, LC3B, LC3C, GABARAP, GABARAPLI and 
GABARAPL2. The Gateway Cloning (Invitrogen) system was used to generate 
GFP-, mCherry-, Myc- and Flag/HA-tagged constructs. In brief, TBK1, 
TBK1(K38M), NDP52, OPTN, p62, DFCP1, WIPI1 and ULK1 were cloned into 
pDONR2333. Mutations in complementary DNA sequences were introduced 
using PCR site-directed mutagenesis in the pDONR2333 vector, (sequences of 
mutagenesis primers used are available on request) then recombined into 
pHAGE-N-Flag/HA, pHAGE-N-GFP, pHAGE-N-mCherry and/or pDEST-N- 
myc using LR Clonase (Invitrogen) as per the manufacturer’s protocol. All con- 
structs generated in this study were verified by sequencing. 

To generate stably transfected cell lines, retroviruses (for pBMN-mEGFP-C1 
constructs, pBMN-mCherry-parkin, pBMN-puro-P2A-FRB-Fis1, pCHAC-mt- 
mKeima-IRES-MCS2) and lentiviruses (for pHAGE and pDEST constructs) were 
packaged in HEK293T cells. HeLa cells were transduced with virus for 24h with 
8 tg ml’ polybrene (Sigma) then optimized for protein expression via selection 
(puromycin or blasticidin) or fluorescence sorting. 

Translocation and mitophagy treatments. Cells were either left untreated or 
treated with 10 1M oligomycin (Calbiochem), 4 1M antimycin A (Sigma) in fresh 
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growth medium for different periods of time as indicated. Some experiments were 
performed with 10 1M carbonyl cyanide m-chlorophenyl hydrazine (CCCP) as 
indicated (Sigma-Aldrich). We chose to use oligomycin/antimycin A to depol- 
arize mitochondria in most of our experiments, as they are specific mitochondrial 
respiratory complex inhibitors and less toxic. Long treatment time points of both 
oligomycin/antimycin A and CCCP were also supplemented with the apoptosis 
inhibitor 20 4M QVD (ApexBio) to prevent cell death. 

Immunoblotting and phos-tag gels. HeLa cells seeded into six-well plates were 
either untreated or treated with 10 uM oligomycin (Calbiochem), 4 1M antimy- 
cin A (Sigma) and 20 1M QVD (ApexBio) in fresh growth medium for different 
periods of time as indicated. Cells were lysed in 1X LDS sample buffer 
(Life Technologies) supplemented with 100mM dithiothreitol (DTT; Sigma) 
and heated to 99°C with shaking for 7-10 min. Approximately 25-50 1g of 
protein per sample was separated on 4-12% Bis-Tris gels (Life Technologies) 
according to manufacturer’s instructions, and then transferred to polyvinyl 
difluoride membranes and immunoblotted using antibodies as indicated. To 
assess mitophagy, COXII quantification was conducted using ImageLab software 
(BioRad). For uncropped images of all immunoblots, see Supplementary 
Information. 

To dephosphorylate samples, cells were collected as above and lysed in 1X NEB 
buffer 3 (New England BioLabs) supplemented with 1% Triton X-100 and passed 
through a 26.5-gauge needle. Calf intestinal phosphatase (CIP; New England 
BioLabs) was added to half of the cell lysate and the other half was used as an 
untreated control. Both samples were incubated for 1 h at 37 °C and analysed by 
SDS-PAGE and immunoblotting. 

To analyse beclin phosphorylation, lysates were prepared in sample buffer 
lacking EDTA and run on 8% Tris-glycine gels containing 20 1M phos-tag 
(Wako) and 40 1M MnCl, as described previously*'. Gels lacking phos-tag were 
run simultaneously as a negative control. Electrophoresis and western transfer 
were carried out using standard protocols with the exception that phos-tag gels 
were incubated in 10mM EDTA for 10min to remove excess Mn** before 
transfer. 

Immunoprecipitation. Wild-type or PINK1 knockout HeLa cells were transi- 
ently transfected with HA-tagged ubiquitin, ubiquitin(S65A) or ubiquitin(S65D) 
with or without mCherry-parkin for 24 h. Cells were collected, lysed and the HA- 
ubiquitin was immunoprecipitated as reported previously’, using anti-HA-con- 
jugated beads (Pierce). To deubiquitinate the bound proteins, after binding the 
HA-ubiquitin, beads were washed three times and incubated in 50 mM Tris-HCl, 
pH7.5, 150 mM NaCl, 5mM DTT and 1.47 pg USP2 (Boston Biochem) at 37 °C 
for 1 h. The reaction was stopped and the remaining bound proteins were washed 
five times with 1 ml of buffer (50 mM Tris-HCl, pH 7.5, 150 mM NaCl), and then 
eluted by boiling with 1x LDS sample buffer. 

In vitro phosphorylation. Strep-tagged ubiquitin was incubated with either 
wild-type or kinase-dead Tribolium castaneum PINK1 (TcPINK1) as previously 
reported’. This ubiquitin was then incubated with cytosol from wild-type HeLa 
cells in 20 mM HEPES-KOH, pH 7.6, 220 mM mannitol and 70 mM sucrose at 
4°C for 1h. Strep-tactin beads (Qiagen) were then added to bind the strep- 
ubiquitin for an additional 1h at 4°C. The ubiquitin and bound proteins were 
then eluted with 50 mM biotin in 50 mM Tris for 15 min at room temperature. 
Samples were then diluted in LDS sample buffer before SDS-PAGE and immu- 
noblot analysis. 

Immunofluorescence microscopy. HeLa cells, seeded in two-well chamber slides 
(Lab-Tek), were treated as indicated in the figure legends. After treatment, cells 
were rinsed in PBS and fixed for 15 min at room temperature with 4% para- 
formaldehyde. Cells were then permeabilized and blocked with 0.1% Triton 
X-100, 3% goat serum in PBS for 40 min. For immunostaining, cells were incu- 
bated with antibodies (as indicated in figure legends) diluted in 3% goat blocking 
serum overnight at 4°C, then rinsed with PBS and incubated with either anti- 
rabbit or mouse Alexa-Fluor-488- and Alexa-Fluor-633-conjugated secondary 
antibodies (Life Technologies), or anti-chicken Alexa-Fluor-488-conjugated 
antibody (Life Technologies) for 1h at room temperature. Cells were washed 
three times for 5 min each with 1% Triton X-100 in PBS. During the final wash 
step, cells were incubated with 10 pgml + DAPI (Sigma) in PBS for 5 min. To 
measure mitophagy by mtDNA immunostaining, images were obtained from 
samples stained with DAPI and immunostained for DNA using a plan- 
Apochromat 63X/1.4 oil DIC objective on an LSM 510 microscope (Zeiss). 
Four image slices were collected through the Z plane encompassing the top 
and bottom of the cells. Image analysis was performed on all images collected 
in the Z plane using Volocity software (Perkin Elmer v6.0.1). The percentage of 
mtDNA stain remaining was calculated using the following formula: 
(cDNA, — nDNA,)/n, in which cDNA, is the total cellular DNA volume deter- 
mined by staining using anti-DNA antibodies, and nDNA, is the total nuclear 
DNA stain volume determined using DAPI, and n denotes the number of cells. 
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The mtDNA stain volume in untreated cells was normalized to 100% and the 
amount of mtDNA stain remaining after drug treatment was subsequently deter- 
mined. Final values represent data acquired from 50-200 cells from three inde- 
pendent experiments, and investigators were not blinded to allocation during 
image analysis. 

To analyse LC3/mitochondrial protein colocalization; cells were treated, fixed 
and immunostained as above. Between 5 and 8 slices were imaged through the Z 
plane using either a plan-Apochromat 63 or 100X/1.4 oil DIC objective on a 
CW STED confocal microscope (Leica). Volocity software (Perkin Elmer, v6.0.1) 
was used to measure the intensity of the GFP signal representing LC3 in the 
volume occupied by mitochondria (as defined by the TOM20-positive region) 
and the cytosol (as defined by TOM20-negative region). ‘Normalized mitochon- 
drial LC3’ was calculated using the following formula: normalized mitochondrial 
LC3 = (m;/my)/(ci/cy), in which m; denotes the mitochondrial GFP intensity, m, 
denotes the mitochondrial volume, c; denotes the cytosolic GFP intensity, and 
cy denotes the cytosolic volume. The resulting normalized mitochondrial LC3 is 
equal to 1 if the intensity of GFP is equal per volume in the cytosolic and mito- 
chondrial volumes (no translocation) and is above one if the mitochondrial 
intensity is higher per volume (translocation). Final values for normalized mito- 
chondrial LC3 represents data acquired from 50 to 105 cells from three inde- 
pendent experiments. 

For GFP-DFCP1, GFP-WIPI1 and GFP-ULK1 puncta analysis, cells were 
treated, prepared and imaged on the CW STED as above with the addition of 
immunostaining using either rabbit or chicken GFP antibodies to enhance the 
signal in the green channel. GFP-DFCP1 puncta were quantified using Volocity 
software (Perkin Elmer v6.0.1) and GFP-WIPI1 and GFP-ULK1 puncta were 
quantified manually. Investigators were not blinded to allocation during image 
analysis. Colocalization of autophagy receptors with GFP-DFCP1 or GFP-ULK1 
was assessed with line scans using LAS AF software (Leica, v.2.6.0.7266). 
Heterodimerization. The C-terminal Fis1 tail of human Fisl (amino acids 
92-152) was cloned into pC4-RhE vector (ARIAD) at SpeI/BamHI sites to make 
FRB-Fis1 construct, the insert of which was then PCR amplified and cloned into 
pBMN-Z vector together with Puro-P2A sequence at HindIII, Xhol and Not! sites 
by an In-Fusion kit from Clontech to make pBMN-puro-P2A-FRB-Fis1. For 
receptor translocation assays, wild-type HeLa cells stably expressing FRB-Fis1 
were generated using retroviral transduction as described above. Previously 
generated PINK1A110-YFP-2XFKBP (ref. 41) wild-type and kinase-dead and 
individually each mCherry-tagged autophagy receptor were transfected into 
FRB-Fis1 stable HeLa cells for 24h. Cells were then treated with 0.5 1M rapalog 
(Clontech) for 8h as previously described*'. Cells were fixed and stained as 
described above. Cells were manually counted in a blinded manner for transloca- 
tion of mCherry-tagged autophagy receptors to mitochondria. Final values rep- 
resent data collected from 100-150 cells for three independent experiments. 

For ULK1 and DFCP1 rescue analysis, pentaKO HeLa cells stably expressing 
FRB-Fisl were transiently transfected with PINK1A110-YFP-2XFKBP, 
mCherry-ULK1 and one of the autophagy receptors: Myc-OPTN, Myc- 
NDP52, Myc-OPTN(F178A), Myc-OPTN(E50K) or Myc-OPTN(Q398X) for 
18-24 h. Cells were treated with 0.5 .M rapalog or vehicle for 7 h and imaged live 
ona Zeiss 780 in a humidified 37 °C, 5% CO, chamber. To visualize mitochondria 
in vehicle-treated controls, cells were pre-incubated for 10min in 75nM 


Mitotracker Deep Red (Invitrogen) before imaging. Fields of PINK1-YFP-pos- 
itive cells were imaged blindly to the mCherry-containing channel. The images 
were then blinded and counted manually for translocation of mCherry-tagged 
ULK1 to mitochondria. Final values represent >75 cells counted over at least two 
independent experiments. 

Mito-Keima mitophagy assay. mt-mKeima” (a gift from A. Miyawaki) was 
cloned into a pCHAC-MCS-1-IRES-MCS2 vector (Allele Biotechnology). 
PINK1A110-YFP-2XFKBP wild-type and kinase-dead were PCR-amplified 
from the original pC4M-F2E and cloned into pRetroQ-AcGFP-C1 at Nhel/ 
Xhol sites by Gibson assembly kit. The HA-tag was removed, and stop codon 
introduced. Wild-type and pentaKO HeLa cells stably expressing FRB-Fis1, mt- 
mKeima and either wild-type or kinase-dead PINK1A110-YFP-2FKBP were 
generated using retroviral transduction as described above. Flag/HA-receptors 
were stably expressed in these cells by lentivirus transduction as described above 
then treated with 0.5 1M rapalog for 24 h. Cells were then resuspended in sorting 
buffer (145 mM NaCl, 5 mM KCl, 1.8 mM CaCl,, 0.8 mM MgCl, 10 mM HEPES, 
10 mM glucose, 0.1% BSA) containing 10 jig ml” ' DAPI. Analysis was performed 
using Summit software (v6.2.6.16198) on a Beckman Coulter MoFlo Astrios cell 
sorter. Measurements of lysosomal mt-mKeima were made using dual-excitation 
ratiometric pH measurements at 488 (pH7) and 561 (pH4) nm lasers with 
620/29nm and 614/20nm emission filters, respectively. For each sample, 
50,000 events were collected and subsequently gated for YFP/mt-mKeima 
double-positive cells that were DAPI-negative. Data were analysed using 
FlowJo (v10, Tree Star). 

Statistical calculations. No statistical methods were used to predetermine sample 
size, and experiments were not randomized. All statistical data were calculated 
and graphed using GraphPad Prism 6. To assess statistical significance, data from 
three or more independent experiments were analysed using one-way ANOVA 
and Tukey’s post-test with a confidence interval of 95%. All error bars are 
expressed as mean + s.d. In Fig. 4h, i, outliers were removed using ROUT in 
GraphPad Prism 6 with Q=1%, and 1-2 values from each condition were 
removed. 
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Extended Data Figure 1 | Analysis of knockout cell lines and 
characterization of autophagy receptor translocation to damaged 
mitochondria. a, The ATGS knockout cell line was confirmed by 
immunoblotting. b, Representative images of mtDNA nucleoids in HeLa cells 
immunostained with an anti-DNA antibody (green), confirming colocalization 
with the mitochondrial marker TOM20 (red) (n = 3 experiments). 

c, Mitochondrial fractions from mCherry-parkin expressing pentaKO and 
wild-type cells were assessed by immunoblotting. d, mCh-parkin-expressing 
wild-type, pentakO and ATGS5 knockouts were treated with oligomycin/ 


antimycin A with or without MG132. Cell lysates were assessed by 
immunoblotting. e, Expression levels of GFP-tagged OPTN, NDP52, p62, 
NBRI and TAX1BP1 re-expressed in pentaKO cells by immunoblotting. 

f, Representative images of mCh-parkin-expressing pentaKOs from 

e immunostained for TOM20 (n = 3 experiments). g, Expression of GFP- 
TOLLIP in mCh-parkin pentaKOs. h, PentakOs mCh-parkin and with or 
without GFP-TOLLIP expression were immunoblotted. i, Representative 
images of mCh-parkin pentaKOs expressing GFP-TOLLIP immunostained 
for TOM20 (n = 3 experiments). Scale bars, 10 um. 
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Extended Data Figure 2 | OPTN, NDP52 and TAX1BP1 triple knockout 
analysis and disease-associated mutations. a, b, Knockout cell lines with or 
without mCh-parkin expression were immunoblotted (a) and COXI] levels 
were quantified (b). c, A panel of human tissue lysates was immunoblotted. 
d, Expression of wild-type or mutant GFP-OPTN in mCh-parkin pentakKO 
cells. e, Quantification of cells in f. More than 100 cells per condition. 

f, Representative images of mCh-parkin pentaKO cells expressing GFP-OPTN 
mutants immunostained for TOM20 (n = 3). g, PentaKOs expressing 
mCh-parkin were rescued with wild-type or mutant GFP-OPTN, analysed by 


immunoblotting. See Fig. 2e for quantification of COXII. h, Expression of wild- 
type or mutant GFP-NDP52 in mCh-parkin pentaKO cells. i, Quantification 
of cells in j. More than 100 cells per condition. j, Representative images of 
mCh-parkin pentaKOs expressing wild-type or mutant GFP-NDP52 were 
immunostained for TOM20 (n = 3). k, PentaKO cells expressing mCh-parkin 
rescued with wild-type or mutant GFP-NDP52 were analysed by 
immunoblotting. See Fig. 2f for quantification of COXII. Data are mean + s.d. 
from three (b and i) and two (e) independent experiments. ***P < 0.001 (one- 
way ANOVA). Scale bars, 10 um. 
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Extended Data Figure 3 | TBK1 activates OPTN in PINK1/parkin 
mitophagy. a, Representative images of untreated mCh-parkin cells and 
merged images of treated cells as indicated immunostained for DNA. See Fig. 2a 
for anti-DNA and DAPI images of treated samples (n = 3). b, Cell lysates from 
wild-type, NDP52/OPTN DKO, OPTN knockout and NDP52 knockout cells 
with or without mCh-parkin expression were immunoblotted for TBK1 
activation. c, Cell lysates from wild-type and PINK1 knockout cells without 
parkin expression were immunoblotted for TBK1 activation (S172 
phosphorylation). d, Confirmation of TBK1/NDP52 DKO, TBK1/OPTN DKO 


and TBK1 knockout by immunoblotting. e, Knockout cell lines from d were 
immunostained for TOM20 (n = 3). f, Representative images of untreated 
mCh-parkin wild-type and knockout cells, and merged images of treated cells 
as indicated were immunostained for DNA. See Fig. 2g for anti- DNA/DAPI 
images treated samples (n = 3). g, TBK1/NDP52 DKO cells rescued with GFP- 
TBK1 wild-type or K38M, or GFP-OPTN(S177D) and were assessed by 
immunoblotting. h, Cells in g were assessed by immunoblotting. 

i, Quantification of COXII levels in h displayed as mean + s.d. from three 
independent experiments. **P < 0.005 (one-way ANOVA). Scale bars, 10 um. 
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Extended Data Figure 4 | Parkin-independent recruitment of receptors to 
mitochondria through PINK1 activity. a, Isolated mitochondria from wild- 
type and pentaKO cells with or without FRB-Fis1] and with wild-type or 
kinase-dead PINK1A110-YFP-2FKBP were immunoblotted. 

b-e, Representative images of pentaKO cells expressing FRB-Fis1, wild-type or 
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Extended Data Figure 5 | PINK] directly stimulates mitophagy in the 
absence of mitochondrial damage. a, b, Cells were treated with rapalog and 
analysed by FACS for lysosomal-positive mt-mKeima. Representative data for 
wild-type HeLa (a) and pentaKO cells (b) without or with Flag/HA-OPTN. 
c, d, Cell lysates from pentaKO cells expressing FRB-Fisl, PINK1A110-YFP- 
2X FKBP, mt-mKeima and wild-type Flag/HA-OPTN (c) or mutants, (d) Flag/ 
HA-p62, wild-type Flag/HA-NDP52 or NDP52 mutants as indicated were 
assessed for receptor expression by immunoblotting. e, f, Cells from ¢ and 
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d were treated with rapalog and analysed by FACS for lysosomal-positive 
mt-mKeima. Representative data of two experiments are presented. g, Cell 
lysates from pentaKO cells expressing FRB-Fis1, with or without Flag/HA- 
OPTN and wild-type or kinase-dead PINK1A110-YFP-2FKBP were 
assessed for OPTN by immunoblotting. h, Flag/HA-OPTN pentaKO cells 
expressing FRB-Fis1, PINK1A110-YFP-2 FKBP, mt-mKeima-transfected 
and either vector or untagged parkin were analysed by FACS. Representative 
data of two experiments are presented. 
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Extended Data Figure 6 | PINK1 directly stimulates mitophagy after 
mitochondrial damage. a, c, Representative FACS data of mt-mKeima- 
expressing wild-type or PINK1 knockout HeLa cells (a) or PINK1 knockout 
cells rescued with PINK1 wild-type and then untreated or treated with 
oligomycin and antimycin A (c). b, d, Average percentage of mitophagy for two 
replicates of a and ¢, respectively. e, Representative images of wild-type 
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HeLa cells expressing mCh-OPTN and treated with oligomycin and antimycin 
A as indicated and immunostained for TOM20 (n = 3). f, Quantification of 
mCh-OPTN translocation from cells in e. Data are mean = s.d. from three 
independent experiments. ***P < 0.001 (one-way ANOVA). NS, not 
significant. 
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Extended Data Figure 7 | OPTN and NDP52 preferentially bind phospho- 
mimetic ubiquitin. a, HeLa cells expressing mCh-parkin and wild-type 
HA-ubiquitin (HA-UB) or mutants S65D or S65A were treated with CCCP. 
HA-ubiquitin was co-immunoprecipitated and the bound fraction was 
analysed by immunoblotting. Quantifications of the total bound fraction of 
OPTN, NDP52 and p62 are shown. b, HA-ubiquitin transfected into HeLa cells 
with mCh-parkin were then treated with CCCP, and HA-ubiquitin was 
immunoprecipitated. The bound fraction was treated with the deubiquitinase 
USP2, and washed to remove all unbound protein after deubiquitination. 
Quantification of the total bound fraction of OPTN, NDP52 and p62 are shown 


on the right. c, d, Strep-tactin-tagged ubiquitin (strep-UB) was incubated with 
either wild-type or kinase-dead PINK] in an in vitro phosphorylation reaction, 
immunoblotted with an anti-phosphoS65 ubiquitin antibody (c), and then 
incubated with cytosol collected from untreated, wild-type HeLa cells. The 
ubiquitin was then pulled down using strep-tactin beads and analysed by 
immunoblotting (d). e, Quantification of bound OPTN and p62 normalized to 
total ubiquitin. Data in a, b and e are mean + s.d. from three independent 
experiments. *P < 0.05, **P < 0.005, ***P < 0.001 (one-way ANOVA). 
Dagger symbol denotes nonspecific band. 
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Extended Data Figure 8 | Analysis of LC3 family members and their 
translocation to damaged mitochondria in autophagy receptor knockout 
cell lines. a, Representative images of wild-type, pentaKO and ATGS knockout 
HeLa cells expressing mCh-parkin and GFP-LC3B were immunostained for 
TOM20 (n = 3). b, Cell lysates from mCh-—parkin expressing wild-type, 
pentaKO and ATGS knockout cells were immunoblotted. c, Representative 
images of wild-type, NDP52/OPTN DKO and pentaKO cells expressing 
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mCh-parkin and GFP-tagged LC3A, LC3B or LC3C were immunostained for 
TOM20 (n = 3; see Fig. 4a for quantification). d, e, Representative images of 
wild-type and NDP52/OPTN/TAX1BP1 triple knockout cells expressing 
mCh-parkin and GFP-LC3C were immunostained for TOM20 (n = 3) (d) and 
quantified for GFP-LC3C translocation to mitochondria (e). Data in e are 
mean + s.d. from three independent experiments. ***P < 0.001 (one-way 
ANOVA). Scale bars, 10 um. 
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Extended Data Figure 9 |GABARAP proteins do not translocate to 
damaged mitochondria, and early stages of autophagosome biogenesis 
mediated by WIPI1 and DFCP1 are inhibited in autophagy-receptor- 
deficient cell lines. a—c, Representative images of wild-type, NDP52/OPTN 
DKO and pentaKO cells expressing mCh—parkin and GFP-tagged GABARAP, 
GABARAPLI or GABARAPI2 (a), GFP-WIPI1 (b) or GEP-DFCP1 (c) 
immunostained for TOM20 (n = 3 for each condition, see Fig. 4b and c 


for quantification of b and c). d, mCh-parkin cell lines as indicated were 
subjected to either phos-tag SDS-PAGE or standard SDS-PAGE followed by 
immunoblotting. Arrows indicate the position of phosphorylated beclin 
species. e, Representative images of untreated wild-type, NDP52/OPTN DKO 
and pentaKO cell lines expressing mCh-parkin and GFP-ULK1 were 
immunostained for TOM20 and GFP (n = 3 experiments). Scale bars, 10 pum. 
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Extended Data Figure 10 | OPTN and NDP52 rescue DFCP1 and ULK1 
recruitment deficit in pentaKOs. a, Representative images of pentaKO cells 
expressing mCh-parkin, GFP-DFCP1 and the indicated Flag/HA-tagged 
autophagy receptors immunostained for haemagglutinin (n = 2 experiments). 
Panels on the right display co-localization of Flag/HA-tagged constructs and 
GFP-DFCP1 by fluorescence intensity line measurement. b, Representative 
images of pentaKO cells expressing mCh-parkin and GFP-ULK1 that were 
rescued by Flag/HA-OPTN, Flag/HA-NDP52 and Flag/HA-p62, and 
immunostained for haemagglutinin and GFP. Arrows indicate HA-tagged 
receptor puncta (n = 2). Right panels display colocalization of haemagglutinin 
and GFP by fluorescence intensity line measurement. c, d, Representative 
images of pentaKO cells stably expressing FRB-Fis1 and transiently expressing 
PINK1A110-YFP-2FKBP and vector or Myc-tagged receptors, that were 


untreated (c) or treated with rapalog (d) and imaged live (nm = 3 experiments, 
see Fig. 4h, i for quantification of c, d). Scale bars, 10 1m. e, Old and new models 
of PINK1/parkin mitophagy. The old model is dominated by parkin 
ubiquitination of mitochondrial proteins. In this, PINK1 has a small initiator 
role with the main function being to bring parkin to the mitochondria. The 
new model depicts parkin-dependent and -independent pathways leading to 
robust and low-level mitophagy, respectively. On the basis of our data, PINK] is 
central to mitophagy both before and after parkin recruitment by 
phosphorylating ubiquitin to recruit parkin and autophagy receptors to 
mitochondria, to induce clearance. In the absence of parkin (right), this occurs 
at a low level owing to the relatively low basal ubiquitin levels on mitochondria. 
When parkin is present, it serves to amplify the PINK1 generated phospho- 
ubiquitin signal, allowing for robust and rapid mitophagy induction. 
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Structural insights into p-opioid 


receptor activation 


Weijiao Huang'*, Aashish Manglik'*, A.J. Venkatakrishnan!*?, Toon Laeremans*”, Evan N. Feinberg’, Adrian L. Sanborn'*?, 
Hideaki E. Kato!, Kathryn E. Livingston®, Thor S. Thorsen’, Ralf C. Kling’, Sébastien Granier®, Peter Gmeiner’, 
Stephen M. Husbands’, John R. Traynor®, William I. Weis!'°, Jan Steyaert*®, Ron O. Dror®** & Brian K. Kobilka’ 


Activation of the p-opioid receptor (uOR) is responsible for the efficacy of the most effective analgesics. To shed light on 
the structural basis for »OR activation, here we report a 2.1 A X-ray crystal structure of the murine »OR bound to the 
morphinan agonist BU72 and a G protein mimetic camelid antibody fragment. The BU72-stabilized changes in the »OR 
binding pocket are subtle and differ from those observed for agonist-bound structures of the B.-adrenergic receptor 
(B2AR) and the M2 muscarinic receptor. Comparison with active BAR reveals a common rearrangement in the packing 
of three conserved amino acids in the core of the pOR, and molecular dynamics simulations illustrate how the 
ligand-binding pocket is conformationally linked to this conserved triad. Additionally, an extensive polar network 
between the ligand-binding pocket and the cytoplasmic domains appears to play a similar role in signal propagation 


for all three G-protein-coupled receptors. 


The most powerful analgesic and addictive properties of opiate alka- 
loids are mediated by the WOR’. As the receptor primarily responsible 
for the effects of opium, the LOR is one of the oldest drug targets 
within the pharmacopeia’. Opioid receptors are highly versatile sig- 
nalling molecules. Activation of the OR results in signalling through 
the heterotrimeric G protein G,, resulting in analgesia and sedation as 
well as euphoria and physical dependence’. The 1OR can also signal 
through arrestin, and this pathway has been attributed to adverse 
effects of opioid analgesics including tolerance, respiratory suppres- 
sion, and constipation*®. 

The LOR has been the subject of intense focus for drug-discovery 
efforts over the past century, with the identification of numerous 
ligands of varying efficacy. These drugs occupy a wide chemical spec- 
trum, from small organic molecules to a variety of endogenous and 
synthetic peptides’. Structure-activity studies have revealed that 
subtle changes in ligand structure can convert an agonist into an 
antagonist’. These studies have yielded a general hypothesis for the 
information encoded within G-protein-coupled receptor (GPCR) 
ligands where distinct pharmacophores within a drug are responsible 
for efficacy (message) or selectivity (address)* (Fig. 1a). For the mor- 
phinan ligands, our previous structural examination of the inactive 
states of the OR and the OR revealed molecular insights into ligand 
selectivity”’®. To understand the structural basis for WOR activation, 
we obtained a structure of this receptor in the active state using a 
combination ofa high-affinity agonist and a G protein mimetic camelid 
antibody fragment. A comparison of this structure with the inactive- 
state structures of the pOR’ and 6OR!*"’, as well as the inactive- and 
active-state structures of the B,AR'?"°, M2 muscarinic receptor 
(M2R)'*”’, and rhodopsin’*”’, provide insights into shared mechan- 
isms of GPCR activation. 


Nanobody-stabilized structure of the »OR 


The active states of ligand-activated GPCRs are probably unstable, 
even when bound to full agonists*”*. However, the active conforma- 
tion can be stabilized by interactions between a receptor and its cog- 
nate G protein. This stabilization is reflected in a higher affinity for 
agonists when GPCRs are in complex with their cognate G protein”. 
In the case of the OR, the affinity for the morphinan agonist BU72 is 
enhanced by 47-fold when coupled to the G protein G; (Fig. 1b, c). 
Efforts to obtain a structure of activated LOR in complex with G; have 
thus far not been successful. As an alternative, we have previously 
used camelid single-domain antibody fragments (nanobodies) as G 
protein mimetics to stabilize the active conformation of the B,AR and 
M2R for structural study’*’*"’. For the BAR, the conformation of 
the receptor obtained in complex with the G, mimetic nanobody 80 
(Nb80) was nearly identical to that in the B,AR-G, complex” (root 
mean squared deviation, 0.61 A). 

To generate G protein mimetic nanobodies for the LOR, llamas 
were immunized with purified 41OR bound to the peptide agonist 
[Dmt']DALDA (ref. 26) and reconstituted into phospholipid vesi- 
cles”. We examined the ability of selected nanobodies to stabilize 
the high-affinity state for 1OR agonists. Purified 4.OR was reconsti- 
tuted into high-density lipoprotein (HDL) particles and agonist com- 
petition assays were performed in the presence or absence of 
nanobodies (Fig. 1b). In the presence of 5 4M nanobody 39 (Nb39), 
the affinity of the potent morphinan agonist BU72” increases from 
470pM to 16pM (Fig. 1b). BU72 has a dissociation half-life of 
140 min in the presence of Nb39 (Extended Data Fig. 1b). Nb39 also 
enhances the affinity of 4OR agonists DAMGO and endomorphin-2, 
indicating that the effect is not limited to morphinan agonists 
(Extended Data Fig. 1a). 
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Figure 1 | Activated structure of 1OR bound to BU72 and Nb39. 

a, Structures of prototypical opioid ligands highlighting regions involved in 
encoding efficacy (message) and selectivity (address). b, *H-diprenorphine 
(H-DPN) radioligand competition binding of pOR in HDL particles. In the 
presence of G;, the affinity of the morphinan agonist BU72 increases 47-fold. 
The two observed binding sites indicate the affinity of BU72 for receptor 
coupled to G; (Knignh) and uncoupled to G; (Kjow). A similar 29-fold increase in 
affinity is observed in presence of Nb39. The binding curves are representative 


Crystals of the LOR bound to BU72 and Nb39 were obtained in a 
monoolein lipidic mesophase** and a complete data set to 2.1 A was 
obtained by merging diffraction data from four crystals (Extended 
Data Table 1). Nb39 binds to the intracellular surface of WOR 
(Fig. 1d and Extended Data Fig. 2) and mediates the majority of 
packing interactions between lipidic layers and adjacent Nb39-WOR 
complexes in the crystal lattice. There are no packing interactions 
involving the extracellular surface of the receptor (Extended Data 
Fig. 1c). We observe a limited parallel dimeric packing interaction 
between OR molecules involving the extracellular end of transmem- 
brane helix 1 (TM1), TM2, the first extracellular loop (ECL1), and 
helix 8 with a buried surface area of 460 A” (Fig. le). A similar inter- 
face between TM1, TM2, and helix 8 was also observed in the inactive 
structure of HOR with a slightly larger buried surface area of 615 A” 
(Fig. le). The inactive structure also identified a more extensive par- 
allel dimer interaction involving TM5 and TM6 with a buried surface 
area of 1,460 A? (Fig. 1f). This interaction involving TM5 and TM6 
would not be compatible with the conformational changes we observe 
in the active state (Fig. 1f). It is important to note that the phys- 
iological relevance of these interfaces remains unclear. 

Structural differences in the extracellular surface between inactive 
and active LOR are relatively small (Fig. 2a) with the exception of the 
proximal N terminus, as discussed below. Conformational changes at 
the cytoplasmic surface of the L}OR observed upon activation are 
similar to those observed for the B.AR, M2R and rhodopsin, with 
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of at least three experiments performed in triplicate, and the data and error bars 
represent the mean ~ s.e.m. ¢, Structure of the high affinity agonist BU72. 

d, Overall structure of the tOR-BU72-Nb39 complex. e, An interface between 
TM1-TM2 and helix 8 (H8) is observed in both inactive and active structures 
of the LOR. The residues comprising the interface are highlighted in dark 
colours on the surface view. f, The TM5-TM6 interface observed for inactive 
LOR is not compatible with the active state due to clashing residues in TM5 and 
TM6 (highlighted in red). 


large outward movement of TM6 and a smaller inward movement 
of TM5 and TM7 (Fig. 2a and Extended Data Fig. 3). The conserved 
E/DRY motif at the intracellular end of TM3 plays a role in main- 
taining GPCRs in the inactive state. In rhodopsin, an ionic interaction 
between R135**? and E247°*° in TM6 stabilizes TM6 in an inactive 
conformation (superscript numbers follow the Ballesteros—Weinstein 
numbering method for GPCRs”’). While there is no acidic amino acid 
at the end of TM6 of OR that can form a similar salt bridge with 
R165°°°, R165°°° can form a hydrogen bond with T279°** (Fig. 2b). 
In the active states of the tOR and metarhodopsin II, R**° forms a 
hydrogen bond with Y°**, stabilizing the inward movement of TM5 
(Fig. 2b). R®*° and Y>°® assume a similar orientation in the M2R and 
the B,AR-G, complex; however, they are not close enough to form a 
hydrogen bond (Extended Data Fig. 3a). 


Agonist binding pocket 

The morphinan scaffold of BU72 binds to the activated WOR in a 
similar orientation to that observed previously for the irreversible 
antagonist B-funaltrexamine (B-FNA) at the inactive LOR (Fig. 3a 
and Extended Data Fig. 4). Fig. 3b shows that the overall structural 
differences in the orthosteric binding pockets of active and inactive 
HOR are relatively subtle. The majority of interactions between BU72 
and active OR are hydrophobic or aromatic in nature, with the 
exception of two conserved polar interactions (Fig. 3c and Extended 
Data Fig. 4). As observed previously in the inactive structures of the 
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Figure 2 | Structural comparison of inactive and active pOR. a, Active LOR 
undergoes a 10 A outward displacement of TM6 on activation. The 
extracellular domain of the receptor shows minimal changes upon activation. 
H8, helix 8. b, Left, comparison of the conserved E/DRY motif in the inactive 
structures of LOR (blue) and rhodopsin (Rho, brown) shows a polar interaction 
between R®°" and T®** in HOR, analogous to the ionic lock between R*°° 
and E°*° observed for rhodopsin. Right, comparison of the same region in the 
active state of 1tOR (green) and Rho (purple) shows a conserved interaction 
between R*°? and Y?**. 


HOR, SOR and KOR”, the phenolic hydroxyl of BU72 engages in a 
water-mediated interaction with H297°°*. In the active state, this 
network is more extended, and involves Y148*** and the backbone 
carbonyl of K233°*?. An ionic interaction between the morphinan 
tertiary amine and D147°*? is seen in both the active HOR bound to 
BU72 and in inactive HOR bound to B-FNA (Fig. 3c). 

We also observe an unexpected interaction between BU72 and the 
amino terminus of the OR, which forms a lid over the ligand-binding 
pocket (Fig. 3d and Extended Data Fig. 5). In particular, the amino- 
terminal residue H54 is positioned 2.6 A from secondary amine of 
BU72. It is of interest that binding of BU72, as well as other structur- 
ally unrelated peptide agonists like DAMGO and [Dmt']DALDA, 
lead to conformational changes in the truncated amino terminus that 
can be detected by nuclear magnetic resonance (NMR) spectroscopy 
(Sounier et al., ref. 31). While interactions between the amino ter- 
minus and the transmembrane core may indeed be important for 
uOR function”, the specific interaction observed here is unlikely to 
be physiologically relevant as H54 is not highly conserved and the 
H54A mutation does not alter the affinity of BU72 in OR with the 
full-length amino terminus (WOR wild-type Kj = 21 pM and OR 
H54A K;=30pM in HEK293 membranes). Additionally, using a 
bioluminescence resonance energy transfer assay’, we observe no 
significant difference in the ECs9 of G; activation by BU72 between 
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Figure 3 | The pOR agonist-binding pocket. a, BU72 and B-funaltrexamine 
(B-FNA) occupy a similar pose in the LOR binding pocket. The common 
morphinan scaffold shared by both ligands is highlighted. b, Binding-pocket 
residues of inactive (blue) and active (green) LOR viewed from the extracellular 
side. c, Polar interactions between BU72 and active HOR. d, BU72 and 
binding-pocket residues shown with the proximal amino terminus (grey). 

e, View of the polar cavity extending towards the intracellular side of 1OR in the 
active state. f, The cyclopropylmethyl group of the antagonist BU74 can be 
docked to fit within the polar cavity of the active state. However, molecular 
dynamics simulations show that this pose is unstable (Extended Data Fig. 6). 


the wild-type receptor and the H54A mutant (ECs, for wild type of 
79 + 17 pM and H54A mutant of 67 + 20 pM). 

While the morphinan core of BU72 is unambiguously placed 
within the electron density, there are two unexplained features of 
the ligand. We observe a strong positive electron density (F, — F. 
signal) between the H54 side chain and the BU72 amine (Extended 
Data Fig. 5c). Attempts to identify the source of this density, including 
mass spectrometry for an alternative ligand structure and the 
presence of an anomalous signal for a coordinated metal were unsuc- 
cessful (Extended Data Fig. 4). Another unexpected finding is the 
near-planar geometry of the sp” hybridized carbon of BU72 to which 
the pendant phenyl group is attached (Extended Data Fig. 4e-h). This 
geometry could be explained by a double bond between this 
carbon and the adjacent nitrogen. We observed a minor fraction of 
such a compound by mass spectrometry in our preparation of 
BU72. However, this compound was not observed in mass spectro- 
metry of the WOR-ligand-Nb39 complex used for crystallography. 
Ultimately, we modelled a higher-energy conformation of BU72 
within the observed electron density (see Extended Data Fig. 4 for 
further discussion). 

In the active LOR, there is a water-filled cavity lined by polar and 
aromatic side chains that extends off of the intracellular end of the 
ligand-binding pocket (Fig. 3e). While a similar cavity is observed in 
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inactive OR, the active-state cavity is larger and completely contigu- 
ous with the morphinan-binding site. Substitution of morphinans with 
a cyclopropylmethy] at the tertiary amine generally results in a ligand 
with antagonist activity’. As an example, BU74, which only differs from 
BU72 in having a cyclopropylmethy] substituent on the tertiary amine 
(Fig. 3f), is an antagonist at the tOR*™. While BU74 can be docked into 
the active state 1,OR structure with the cyclopropylmethyl within the 
polar cavity, there are potential clashes with Y326’** in TM7 and 
W293°** in TM6. Moreover, the cyclopropylmethyl substituent would 
displace one or more water molecules that form part of a polar network 
described in more detail below. Consistent with these observations, 
molecular dynamics simulations reveal that the antagonist BU74 is 
unstable in the position occupied by the agonist BU72 and rapidly 
shifts away from this initial pose (Extended Data Fig. 6). 


Propagation of conformational changes 


The structural difference in the orthosteric ligand-binding pocket 
between active and inactive OR is relatively subtle (Fig. 3b). It is 
difficult to identify specific interactions between the structurally rigid 
agonist and the receptor as being key for OR activation. This stands 
in contrast to activation of the B,AR and M2R, where specific polar 
interactions between the receptor and the smaller, more flexible ago- 
nists contribute to structural changes associated with activation. 
These polar interactions stabilize a 2A inward movement of TM5 
in the B,AR and a 2 A inward movement of TM6 in the M2R. 
While agonist-stabilized changes in the binding pocket differ for 
the LOR, BAR, and M2R, the overall structural changes observed at 
the G-protein interface are very similar (Extended Data Fig. 3). The 
mechanism by which agonist-stabilized changes propagate to the 
cytoplasmic surface appears to be more similar between OR and 
the B,AR compared to the M2R (Fig. 4a). For both the B,AR and 
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Figure 4 | Mechanisms of allosteric coupling in »OR. a, Comparison of the 
structural rearrangements in the conserved core triad of WOR, B2AR, and M2R. 
b, The morphinan ligands BU72 and B-FNA bind to the LOR with a shared 
hydrophobic surface shown in spheres. BU72 binding results in a 1.5 A 
displacement of TM3 towards TM2 and a rotameric change in the sodium 
coordinating residue N150°*°. Red arrows highlight displacement of the ligand 
or TM3 upon activation. c, In a molecular dynamics simulation initiated 
from the active OR structure but with the agonist BU72 removed from the 
binding site, residues in TM3, including the conserved core triad residue 
155°, adopt an inactive-like conformation. The motions of 1155°*° and 
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the WOR, there is a rearrangement of the packing of a triad of con- 
served amino acids F°**, P*°° and I?“° (which we term the conserved 
core triad) that lay just below the binding pocket (Fig. 4a). This 
rearrangement is associated with a counter-clockwise rotation (when 
viewed from the extracellular surface) and outward movement of 
the cytoplasmic end of TM6. Thus, the subtle agonist-stabilized 
rearrangement in the conserved core triad may initiate the cascade 
of structural changes involved in activation of the ,OR and BAR. The 
role of the conserved core triad appears to differ in the M2R, where 
the smaller V111°“° forms a weaker packing interaction with F396°* 
and P198”°°. Asa result, the triad in both inactive and active states of 
the M2R appear similar to the active states of the B.AR (Fig. 4a). 

Given the similarity in allosteric propagation between WOR and 
B.AR, we sought to identify how BU72 stabilizes the active conforma- 
tion of the conserved core triad. In the BAR, the rearrangement of the 
conserved core triad can be attributed to an agonist-stabilized inward 
movement of TM5'*”’. For the OR, however, there are no specific 
interactions between the receptor and BU72 that stabilize the inward 
movement of TM5. Instead, we identify a set of interactions that 
together appear to stabilize the rearrangement of the conserved core 
triad in the HOR. 

Both B-FNA and BU72 share a common morphinan scaffold. This 
common scaffold, however, is positioned differently in the inactive 
and active structures. In both states, residues 1296°°* and V300°°° in 
TM6 and W318’”° and 1322”? in TM7 form a common hydrophobic 
surface for binding the morphinan ligands (Fig. 4b, d, spheres). While 
this surface is similar in both inactive and active states, differences in 
the position of the rigid morphinan agonist and antagonist result in 
differences in the positions of key ligand substituents relative to spe- 
cific residues in TM3 and TM6 that are coupled to the conserved core 
triad of F289°**, P244°°° and 1155°“° (Fig. 4b, d). 
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ligand-contacting residue D147**” are tightly coupled throughout the 


simulation. Atom positions during simulation are plotted relative to the active 
structure, with positive values representing displacement towards the position 
in the inactive structure (see Methods). Dashed horizontal lines represent 

the positions of the indicated atoms in the inactive structure. d, w293°"? is 
slightly closer to the phenolic aromatic of the morphinan in the active state of 
HOR. e, Molecular dynamics simulations show that removal of the agonist 
BU72 from the active structure results in a change in the preferred rotamer of 
w293°48. Molecular dynamics results were consistent across multiple 
simulations; see Supplementary Information. 
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In the active OR structure, the morphinan scaffold of BU72 
adopts a pose that is sterically incompatible with the inactive position 
of TM3. Asa result, the residues of TM3 that interact with the tertiary 
amine (D147* 32) and methyl substituent (M151°*°) of the agonist 
shift 1.5 A towards TM2 relative to its position in the inactive struc- 
ture (Fig. 4b). We used molecular dynamics simulations to assess 
whether the agonist favours this displacement in TM3. In simulations 
of activated WOR with the agonist removed, the previously ligand- 
contacting residues on TM3 quickly relax towards their inactive posi- 
tions (Fig. 4c), often without global structural change of the receptor. 
These motions are tightly coupled to motion of the conserved core 
triad residue 1155°**° towards its inactive position, causing it to repo- 
sition relative to F289°** (Fig. 4b, c). Notably, the position of pD1477°? 
is also coupled to the rotameric state of N150°*°, which coordinates an 
allosteric sodium ion in the inactive state and forms a hydrogen bond 
with a backbone carbonyl of 1146?! in the active state (Fig. 4b, c). 

Another link between the ligand-binding pocket and the conserved 
core triad may be mediated via TM6 through W293°** (Fig. 4d, e). In 
the active OR structure, the aromatic group of BU72 is positioned 
only slightly closer (0.6 A) to W293°** as compared to the same 
aromatic group of B-FNA in the inactive state. However, molecular 
dynamics simulations suggest that agonists stabilize W293°** in the 
rotamer observed in the active-state crystal structure. In simulations 
of the active state, removal of BU72 results in a change in the favoured 
rotamer of W293°** (Fig. 4e). Conversely, we assessed whether an 
agonist bound to the inactive state would stabilize the side chain of 
W293°* in the rotamer associated with the active state. Here, we 
simulated the inactive-state structure with the antagonist B-FNA 
replaced with the agonist B-fuoxymorphamine (B-FOA), which dif- 
fers from B-FNA solely in a methyl substituent at the morphinan 
tertiary amine* (Extended Data Fig. 7a). In these simulations, the 
pose of B-FOA tends to shift towards that observed for BU72 and 
the TM3 residues D147**” and N150°*° shift towards their active- 
state positions; in concert, the side chain of W293° shifts towards its 
active position (Extended Data Fig. 7b, c). W293°"* is spatially juxta- 
posed to the conserved core triad residue F289°™*, and the conforma- 
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Figure 5 | Rearrangement ofa conserved polar network. a,b, Comparison of 
the water-mediated polar network in the active LOR (a) and high-resolution 
inactive OR (b) (PDB ID: 4N6H). To simplify comparisons between different 
receptors, only Ballesteros-Weinstein numbers are used to label amino acid 
side chains. The network extends from the orthosteric ligand-binding site to the 
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tion of W293°** may therefore serve as an important link between the 
ligand-binding pocket and the triad (Fig. 4d). 


Role of polar network in GPCR activation 

In addition to rearrangement of the conserved core triad, comparison 
of the inactive and active structures of WOR reveals an extensive 
network of polar interactions between the orthosteric binding pocket 
and the G-protein-coupling interface that must rearrange upon 
activation (Fig. 5a). The high-resolution electron density maps of 
the active WOR allow us to detect more ordered water molecules 
within this polar network than have been observed in other active- 
state GPCR structures reported to date. To provide a better compar- 
ison of this polar network, we examined the high-resolution inactive 
structure of the highly homologous OR". The inactive structure of 
d5OR at a resolution of 1.8 A identifies more water molecules within 
this polar network than the 2.8 A inactive structure of LOR and there- 
fore highlights the contribution of many hydrogen bonds in stabil- 
izing both the inactive and active states of opioid receptors (Fig. 5b). 
These hydrogen bonds represent many low-energy molecular 
switches that have to be broken and reformed in a concerted manner 
to achieve the active conformation. The polar network interactions for 
inactive-state LOR are likely to be identical to the inactive state of |OR 
because the specific amino acids involved in this network and their 
side chain conformations are identical between the two homologous 
receptors. 

Many of the residues in the polar network are conserved (Extended 
Data Fig. 8a, b, e, f), suggesting that the polar network may also be 
conserved and play a similar role in activation of other family A 
GPCRs. While there are fewer water molecules observed in the 
active-state structure of the BAR, the ones that are resolved are in 
the same positions as those observed in the WOR. Even though the 
resolution in the active B,AR and M2R structures is not sufficient to 
observe as many ordered water molecules, the positions of the con- 
served side chains lining the polar core of the B2,AR and M2R are 
nearly identical to those of the HOR (Extended Data Fig. 8f), suggest- 
ing that they are stabilized by a similar hydrogen-bonding network. 


Active WOR 
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G-protein-coupling domain of the receptor. c, The active structure of HOR 
reveals the basis for sodium ion allosteric regulation of GPCR function. 
Rearrangement of S**? and N**° eliminates the sodium ion coordination site in 
the active state. d, Conserved hydrogen-bonding network in the NPxxY region 
between active WOR and rhodopsin. 
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In the high-resolution inactive-state structure of the 5OR, there is 
an ordered sodium ion that is adjacent to the conserved core triad and 
is coordinated by the side chains of D*°°, N*3> and S*?, as well as by 
w°** through a water molecule (Fig. 5b, c). A sodium ion at a similar 
coordination site has been observed in inactive structures of the 
protease-activated receptor subtype 1 (PAR1)**, the adenosine Aza 
receptor (A>4R)”’ and the f, adrenergic receptor (B, AR)**. For many 
GPCRs, including the WOR, agonist-binding affinity and/or G-protein 
activation is allosterically inhibited by sodium”. Consistent with 
allosteric stabilization of the inactive state by sodium, we do not 
observe a sodium ion in the active-state structure of the HOR, and 
the residues that formed the sodium-binding site rearrange to pre- 
clude sodium binding in the active state (Fig. 5c). 

The polar network ends at the cytoplasmic surface with a web of 
interactions involving the conserved NPxxY sequence (Fig. 5d). Upon 
receptor activation, the NPxxY motif in TM7 moves inward towards 
TM5 where N332’*” and Y336’”° participate in a hydrogen-bond 
network involving Y252°°* in TM5, the backbone carbonyls of 
L158°8 in TM3 and V285°" in TM6, and three ordered water mole- 
cules (Fig. 5d). A similar hydrogen-bond network was previously 
observed in the structure of metarhodopsin II’’. Similar water- 
mediated hydrogen bonds are probably present in active B,AR and 
M2R as the side chains of key residues in this region occupy similar 
positions (Extended Data Fig. 3b). 

It is interesting to speculate on the role of this polar network in the 
energetics of HOR activation. NMR studies reveal that allosteric coup- 
ling of the agonist-binding pocket and the G-protein-binding inter- 
face in the OR is relatively weak, and structural changes in TM6 are 
observed only in the presence of a G protein mimetic nanobody 
(Sounier et al., ref. 31). A similar observation has been made for 
the B,AR*”***°. Using both double electron-electron resonance 
(DEER) spectroscopy and NMR spectroscopy, we observed that in 
the presence of a saturating concentration of the catecholamine ago- 
nist isoproterenol only 20% of B2AR has TM6 in an active-like con- 
formation™. This stands in contrast to the more efficient coupling 
between the orthosteric binding pocket and TM6 in rhodopsin, as 
reflected in the ability to crystallize rhodopsin in an active state with- 
out a G protein or a G protein mimetic nanobody™, and biophysical 
studies that reveal a stronger allosteric coupling between the orthos- 
teric binding pocket and TM6 in rhodopsin**. Comparison of the 
inactive-state structures of rhodopsin and the SOR reveals that the 
SOR has a more extensive polar network on the cytoplasmic side of 
the ligand-binding pocket (Extended Data Fig. 9). This is particularly 
notable when considering the network of hydrogen bonds that main- 
tain TM6 in the inactive conformation (Extended Data Fig. 9). As 
noted above, a similar polar network probably stabilizes the inactive 
states of the LOR and BAR (Extended Data Fig. 8). The less extensive 
polar network stabilizing the inactive state of rhodopsin is compen- 
sated for by the covalent inverse agonist, 11-cis-retinal. This balance 
of non-covalent polar interactions and a covalent ligand imparts rho- 
dopsin with virtually no basal activity, but the ability to rapidly and 
efficiently respond to photoisomerization of retinal. In contrast, the 
polar network in the LOR and BAR help to maintain unliganded 
receptors in an inactive state, but the need to disrupt this network 
makes activation of these receptors energetically less efficient. 


Conclusion 


The structure of activated WOR presented here provides a model for 
how efficacy is encoded within small chemical differences in otherwise 
structurally similar morphinan ligands. Comparison of active OR 
with other active-state structures offers insights into shared and 
subtype-specific mechanisms for the activation of family A GPCRs. 
Additionally, the extensive reorganization of the polar network 
required to achieve the fully active state may explain the inefficient 
allosteric coupling of the orthosteric pocket and the G-protein- 
coupling interface observed in NMR studies for both the hOR and 
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the BAR. It is possible that subtle ligand-specific differences in the 
polar network connections may contribute to preferential activation of 
different signalling proteins. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Expression and purification of pOR. Full length Mus musculus WOR bearing an 
amino-terminal Flag epitope tag and a carboxy-terminal 6 X His tag was 
expressed in Sf9 insect cells using the BestBac baculovirus system (Expression 
Systems). To facilitate removal of flexible amino- and carboxy-terminal regions, a 
tobacco etch virus protease recognition sequence was inserted after residue 51 
and a rhinovirus 3C protease recognition sequence was inserted before residue 
359. Insect cells were infected with baculovirus encoding OR at a density of 
4 X 10° cells ml’ for 48-60 h at 27 °C. Receptor was solubilized and purified in a 
final buffer comprised of 25 mM HEPES pH7.4, 100mM NaCl, 0.01% MNG 
(Anatrace), and 0.001% cholesterol hemisuccinate (CHS), as previously 
described’. 

Llama immunization and selection of »OR-binding nanobodies. Purified HOR 
bound to the antagonist naloxone and with the amino and carboxy termini cleaved 
was incubated with an excess of the agonist [Dmt']DALDA (NIDA Drug Supply 
Program), and further purified by size-exclusion chromatography in a buffer com- 
prised of 20mM HEPES pH7.5, 100mM NaCl, 0.1% dodecylmaltoside (DDM, 
Anatrace), 0.03% CHAPS, 0.01% CHS, and 1 uM [Dmt']DALDA. The resulting 
agonist-bound receptor was reconstituted into phospholipid vesicles composed of 
DOPC (1,2-dioleoyl-sn-glycero-3-phosphocholine, Avanti Polar Lipids) and Lipid 
A in a 10:1 (w:w) ratio at a final receptor concentration of 1.3mgml_'. The 
resulting reconstituted receptor was flash frozen in liquid nitrogen in 100 jig ali- 
quots for llama immunization. 

One llama was immunized over a period of 6 weeks with 0.3 mg of liposome- 
reconstituted JOR purified as described above and bound to the agonist 
[Dmt']DALDA. A phage display library of nanobodies was prepared from peri- 
pheral blood lymphocytes as previously described’. 1OR-binding nanobodies 
were identified by selecting phages that bound liposome-reconstituted tOR in the 
presence or absence of agonist. Ten clones from three families were enriched 
during rounds of selection. One family, which includes Nb39, bind the intracel- 
lular surface and function as G-protein mimetics. Another family of nanobodies, 
including clone Nb35, was identified that competes directly with orthosteric 
antagonists of the OR and binds at the extracellular surface. 

Expression and purification of nanobodies. Nanobodies bearing a carboxy- 
terminal 6 X His tag were expressed in the periplasm of Escherichia coli strain 
WKCé. Cultures were grown to an ODgo9 of 1.0 at 37 °C in Terrific Broth medium 
containing 0.1% glucose, 2mM MgCl,, and 50 jig ml” * ampicillin and induced 
with 0.5 mM isopropyl-B-p-thiogalactoside (IPTG). Cells were harvested after 
overnight growth at 25°C and incubated in a buffer containing 200 mM Tris 
pH8.0, 0.5mM EDTA, 500mM sucrose and 0.5mg ml lysozyme for 1h at 
25 °C. Bacteria were osmotically lysed by rapid dilution in a 4X volume of water. 
The periplasmic fraction was isolated by centrifugation of cell debris, and was 
supplemented with NaCl to a final concentration of 150 mM as well as imidazole 
to a final concentration of 25 mM. Nanobodies were purified from the periplas- 
mic fraction by nickel affinity chromatography, and were subsequently purified 
by size-exclusion chromatography in a buffer comprised of 25 mM HEPES pH 7.5 
and 100mM NaCl. Peak fractions were pooled and concentrated to approxi- 
mately 5mM. 

Radioligand characterization of BU72 and Nb39. Radioligand competition 
assays were performed using purified ,OR reconstituted into high-density lipo- 
protein (HDL) particles comprised of the lipids POPC and POPG (Avanti Polar 
Lipids) in a 3:2 molar ratio as previously described***°. Heterotrimeric G; was 
prepared by co-expressing human G,«1, human GB1 and Gy2 subunits in Hi5 
insect cells using baculoviruses encoding the individual subunits. The G protein 
was purified as previously described”. 

For competition binding experiments, a mixture of 0.02 nM OR and 2.3 nM 
°H-diprenorphine (7H-DPN, Perkin Elmer) was incubated with varying concen- 
trations of agonist in a binding buffer comprised of 25mM HEPES pH 7.4, 
100 mM NaCl, and 0.1% BSA. Experiments were also performed with either 
51M Nb39 or 500nM G,. Binding reactions were incubated for 4h at 25°C. 
Free radioligand was separated from bound radioligand by rapid filtration onto 
a Whatman GF/B filter pretreated with 0.1% polyethylenimine with the aid of a 
48-well harvester (Brandel). Radioligand activity was measured by liquid scintil- 
lation counting. Competition binding data were fit to a one-site model for LOR 
alone and OR with Nb39 and a two-site model for LOR incubated with G; using 
GraphPad Prism 6.0. 

Dissociation studies for BU72 were performed using the method of Motulsky 
and Mahan“. 7H-DPN was diluted in an assay buffer comprised of 20 mM Tris, 
pH7.4, 150mM NaCl, and 0.05% BSA containing WOR in HDL particles with 


either vehicle or different concentrations of BU72 alone or in the presence of 
Nb39. Binding reactions were incubated at 25°C in the dark and nonspecific 
binding was determined in the presence of 101M naloxone. Aliquots of this 
binding reaction were removed at specified time points over the course of 2 or 
8 hand filtered through Whatman GF/C filters with the aid of a Brandel harvester. 
As above, radioligand activity was measured by liquid scintillation counting. 
Dissociation rates for BU72 were determined by fitting data in the ‘kinetics of 
competitive binding’ program in GraphPad Prism 6.02. For K1 and K2, rates of 
°H-DPN association and dissociation were determined through independent 
studies following the same method as above. 

Purification and crystallization of the »;OR-BU72-Nb39 complex. Initial crys- 
tals of a tOR-BU72-Nb complex diffracted to low resolution. In an effort to 
improve the specific activity of purified WOR, we used an extracellular binding 
nanobody, Nb35 during the purification of the receptor. Nb35 binds in the 
orthosteric pocket of WOR with a K; of 12nM. Purified ligand-free HOR, with 
the amino and carboxy termini cleaved was mixed with a 2.5X excess of Nb35, 
and the complex was isolated by Ni-NTA chromatography. Functional OR in 
complex with Nb35 was eluted in a buffer containing 25mM HEPES pH7.5, 
100 mM NaCl, 0.01% MNG, 0.001% CHS, and 250 mM imidazole. The agonist 
BU72 and Nb39 were then added in excess. In the presence of Nb39, BU72 has 
exceptionally high affinity for the WOR and displaces Nb35, resulting in the 
desired tOR-BU72-Nb39 complex. The hHOR-BU72-Nb39 complex was iso- 
lated by size-exclusion chromatography and concentrated to approximately 
50mgml ' for crystallization trials. 

For crystallization, the purified 1}OR-BU72-Nb39 complex was reconstituted 
into a 10:1 (w:w) mixture of monoolein and cholesterol (Sigma). The protein 
solution and lipid were mixed in a 1:1.5 ratio (w:w), and the lipidic cubic phase 
was attained using the two-syringe method”. Thirty nanolitres of the resulting 
mesophase was dispensed onto 96-well glass plates and overlaid with 500 nl of 
precipitant solution using a Gryphon LCP robot (Art Robbins Instruments). 
Crystals grew in a precipitant comprised of 15-25% PEG300, 100 mM HEPES 
pH7.0-7.5, 1% 1,2,3-heptanetriol, 0.5-1.0% polypropylene glycol P 400 
(Hampton Research) and 100-300 mM (NH4),HPOy,. Crystals were observed 
after 2 days and reached full size in 1 week. Crystals were harvested with mesh 
grid loops (MiTeGen) and flash frozen in liquid nitrogen. Loops were 
screened at the Advanced Photon Source GM/CA beamlines 23ID-B and 
23ID-D. Diffraction data were collected at a wavelength of 1.033A using a 
10m beam with fivefold attenuation and exposed for 0.5-1s. An oscillation 
width of 0.1-0.5° was used and diffraction images from four crystals were merged 
to create the final data set. 

Structure determination and refinement. Diffraction images were indexed, 
scaled, and merged by XDS* with statistics summarized in Extended Data 
Table 1. The structure was determined by molecular replacement in Phaser” 
with initial search models including inactive 1OR (PBD ID: 4DKL) and Nb80 
(PDB ID: 3P0G). The structural model was iteratively rebuilt in Coot” and 
refined using Phenix*’. Final refinement statistics are summarized in Extended 
Data Table 1. Molprobity* was used for structure analysis and indicated that 
96.1% of residues were within favoured Ramachandran regions with 3.9% in 
allowed regions. No residues were identified as Ramachandran outliers. Figures 
were prepared using MacPyMOL (Schrédinger, Inc.). 

System setup for molecular dynamics simulations. Simulations of the LOR were 
based on both the antagonist-bound inactive-state crystal structure (PDB ID: 
4DKL) and the agonist-bound active-state crystal structure described in this 
manuscript. Coordinates were prepared by first removing all non-ligand and 
non-receptor molecules except for the cholesterol neighbouring TM7 and for 
crystallographic water molecules near the receptor. For inactive LOR simulations, 
the T4 lysozyme was removed and acetyl and methylamide capping groups were 
placed on R263"? and E270''*. For active LOR simulations, the nanobody was 
removed. In both cases, Prime (Schrédinger, Inc.) was used to model missing 
side-chains, and capping groups were then added to the N and C termini of the 
receptor. Histidine residues were simulated as the neutral Ne tautomer. Other 
titratable residues were simulated in their dominant protonation state at pH7 
except for D1147°°, which was charged in inactive simulations and neutral in 
active simulations. A sodium ion was placed adjacent to D114*°° in inactive 
simulations. 

The OR was simulated in seven distinct conditions. These include: (1) the 
unliganded, inactive LOR, prepared by deleting the covalently bound, co-crystal- 
lized ligand, B-FNA, and adding a proton in its place to K233°9; (2) the inactive 
WOR with the co-crystallized ligand B-FNA; (3) the inactive WOR with agonist 
B-FOA (which does not bind covalently); (4) the unliganded, active WOR, pre- 
pared by deleting the co-crystallized ligand, BU72; (5) the active 1OR with the 
co-crystallized ligand BU72; (6) the active }OR with the co-crystallized ligand 
BU72, with the N-terminal peptide deleted; and (7) the active LOR with the 
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antagonist BU74, with the N-terminal peptide deleted. Simulations of the active 
HOR without an N-terminal peptide were prepared by deleting residues 52 
through 64 of the receptor. Simulations with B-FOA were prepared by docking 
B-FOA to the crystallographic pose of B-FNA. Simulations with BU74 were 
prepared by docking BU74 to the crystallographic pose of BU72 and rotating 
the torsion angle of the methylcyclopropyl group to agree with that of B-FNA’s 
methylcyclopropyl group. 

We performed three to six simulations per condition (Supplementary 
Information). Simulations in a given condition were initiated from identical 
structures, but with initial atom velocities assigned independently and randomly. 

It should be noted that in all liganded simulations, including those with B-FNA, 
B-FOA, BU72, and BU74, the ligand’s tertiary amine nitrogen was protonated 
and therefore the ligand was simulated as a cation. This is necessary for the ligand 
to form the conserved salt bridge with neighbouring D147***. 

Each of the resulting prepared j1OR structures was then aligned to the orienta- 

tions of proteins in membranes (OPM)”* entry for the inactive LOR using 
MacPyMOL (Schrédinger, Inc.). The LOR was modified with disulphide bridges 
and inserted into a hydrated, equilibrated palmitoyloleoylphosphatidylcholine 
(POPC) bilayer using the CHARMM-GUI interface***’. Sodium and chloride 
ions were added to neutralize the system, reaching a final concentration of 
approximately 150 mM. All simulations contained one ),OR embedded in a lipid 
bilayer with 160 POPC molecules. 
Simulation protocol and analysis. Each simulation was performed on two GPUs 
using the CUDA version of PMEMD (Particle Mesh Ewald Molecular Dynamics) 
in Amber14°* © with 2.5-fs time steps. Simulations were heated from 0 K to 100 K 
in the NVT ensemble and then from 100 K to 310 K in the NPT ensemble, both 
with 10.0 kcal mol! A~? harmonic restraints applied to the protein and to the 
lipids. Initial velocities were assigned randomly at the first heating step with 
Langevin dynamics. Subsequently, simulations were equilibrated in the NPT 
ensemble at 310K (controlled with a Langevin thermostat) with pressure of 
1 bar (controlled with anisotropic Berendsen weak-coupling barostat), with har- 
monic restraints on all protein atoms tapered off 1.0 kcal mol”! A’?ina stepwise 
fashion every 2.0 ns starting at 5.0 kcal mol! A~? to 0.0 kcal mol! Ace: for a 
total of 12.0ns of equilibration. Bond lengths to hydrogen atoms were con- 
strained using SHAKE. Production simulations were performed in the NPT 
ensemble at 310 K and 1 bar, using a Langevin thermostat for temperature coup- 
ling and a Monte Carlo barostat for pressure coupling, and were initiated from the 
final snapshot of the corresponding equilibration simulation. Non-bonded inter- 
actions were cut off at 9.0 A, and long-range electrostatic interactions were com- 
puted using the particle mesh Ewald (PME) method, with an Ewald coefficient B 
of approximately 0.31 Al and B-spline interpolation of order 4. The FFT grid 
size was 84 X 84 X 96 for all simulations. We performed a total of 27 simulations, 
which are summarized in the Supplementary Information. 

We used the CHARMM36 parameter set for protein molecules, lipid mole- 
cules, and salt ions, and the CHARMM TIP3P model for water; protein para- 
meters incorporated CMAP terms®'®. These parameters were assigned using the 
ParmEd implementation of Chamber, provided with AmberTools14. Ligand 
parameters were based on the results from the CHARMM ParamChem web 
server, version 0.9.7.1°, and incorporated specific modifications to partial 
charges. When ParamChem reported large penalties (errors) for estimated partial 
charges, better estimates were determined by submitting appropriately chosen 
fragments to ParamChem (see below). 

Parameterization of B-FNA, the co-crystallized ligand in the inactive OR 
structure, required additional steps due to the added complexity of its covalent 
bond with K233°*? on the receptor. To assign parameters to the rest of B-FNA, 
the molecule was fragmented as shown in Supplementary Information section 
“Parameterization” and then re-assembled. In addition, a molecule consisting of 
K233°*° bonded to the portion of B-FNA on the extracellular side of its morphi- 
nan scaffold was submitted to ParamChem, and the results were used to modify 
charges near the bond between the B-FNA carbon and Lys side-chain nitrogen. 
A modified version of the ParmEd program was used to form a bond between 
B-FNA and K233°*?, to parameterize that and neighbouring bonds, and to mod- 
ify neighbouring partial charges according to the derived parameters. The related 
ligand B-FOA (which does not bind covalently to the LOR) was divided into the 
same three fragments as B-FNA for parameterization, with the following mod- 
ifications: the bond between the ligand and K233°°? was cleaved; a C=C double 
bond was formed between the two carbons adjacent to this bond; and the methyl- 
cyclopropyl group was replaced with a methyl group. The co-crystallized ligand 
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of the active LOR, BU72, was similarly parameterized by splitting it into frag- 
ments, individually uploading the fragments to the ParamChem web server, and 
assembling the results together. The fragments can be found in the 
Supplementary Information. The ligand, BU74, was parameterized identically 
to BU72, except that its tertiary amine methyl group was replaced with a methyl- 
cyclopropyl group. 

Quantitative analysis of trajectories was conducted in both cpptraj, an 
Amber14 software package, and VMD®. Plots were smoothed with a moving 
average, with a symmetric triangular smoothing window. The window length was 
20ns, except during the first 10ns of each simulation, when the smoothing 
window extended from time zero to double the time at which the data was 
recorded. Plots were rendered in R with the ggplot2 package as well as in 
Python with the Matplotlib package. 

To compute movement of specified atoms from the active position towards the 
inactive position, as plotted in Fig. 4c, the active and inactive crystal structures 
and each simulation frame were aligned using the Co atoms of residues 80-95, 
102-13, and 181-205 (TMs 1, 2 and 4). For each specified atom, the atom position 
in simulation was projected onto the line connecting the atom positions in the 
active and inactive crystal structures, and the distance of the projected point from 
the position in the active crystal structure position was determined. 
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Extended Data Figure 1 | Characterization of Nb39 and lattice interactions 
in hOR-Nb39 crystals. a, °H-diprenorphine ?H-DPN) competition 
binding shows increased affinity for |tOR-selective agonists DAMGO and 
endomorphin-2 in the presence of Nb39. b, The dissociation half-life (t,/2) 

of BU72 was determined by measuring the association rate of the antagonist 
>H-DPN in the presence of the indicated concentrations of BU72. The 
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dissociation t1/. of BU72 is 43 min and increases to 140 min in presence of 
Nb39. Panels a and b are representative of at least three experiments performed 
in triplicate, and the data and error bars represent the mean + s.e.m. c, Crystal 
lattice packing of the HOR-Nb39 complex shows that most of the contacts 
are mediated by Nb39. The OR extracellular domain is not involved in any 
contacts. 
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Extended Data Figure 2 | 4OR-Nb39 interface. a, Nb39 does not penetrate framework residues. Notably, seven residues of CDR3 remained unresolved in 


as deeply into the core of the OR when compared with the BAR-Nb80 the final model of the }OR-Nb39 complex. b, Nb39 interacts primarily through 
complex and the M2R-Nb9-8 complex. In the B,AR-Nb80 and M2R-Nb9-8 _ hydrogen bonds with residues from ICL2, ICL3 and helix 8 of the LOR. 
complexes, nanobody CDR3 residues bind within the core of the receptor c, Schematic representation of the interactions between HOR and Nb39 
transmembrane bundle. In comparison, Nb39 binding involves more highlighting the numerous Nb39 framework interactions. 
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Extended Data Figure 3 | Cytoplasmic domain rearrangements in 
conserved regions. a, The E/DRY sequence is a highly conserved motif within 
family A GPCRs responsible for constraining receptors in an inactive 
conformation. Comparisons of inactive- and active-state structures around the 
conserved E/DRY residues at the cytoplasmic surface of the LOR, the M2 
muscarinic receptor (M2R), the B, adrenergic receptor (B,AR) and rhodopsin 
(Rho) are shown here. Hydrogen bonds are shown as dotted lines. b, The 
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NPxxY motif is a highly conserved sequence in TM7 among family A GPCRs. 
In the active state WOR, Y’>3 and N’” in TM7 interact with Y°>* in TM5 and 
the backbone carbonyl of L*’ in TM3 through a water-mediated polar 
network. A similar network is observed in the active state of rhodopsin. While 
waters are not observed in the lower-resolution structures of the BAR and 
M2R, the positions of the side chains of y73 N7*? and Y>® suggest a similar 
water-mediated network with putative waters represented by red circles. 
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Extended Data Figure 4 | Conformation of the binding pocket and BU72. 
The 2F, — F, electron density contoured at 2.06 and within 1.8 A of residues 
comprising the active LOR ligand-binding pocket is shown as grey mesh in 

a and b. The same views are shown in c and d with the omit F, — F, density for 
BU72 displayed as an orange mesh. Displayed F, — F- electron density is 
contoured at 3.0c. e, Placement of an energetically minimized conformation of 
BU72 within the F, — F, electron density shows a poor fit for the pendant 
phenyl ring. The conformation of BU72 was minimized using quantum 
mechanical Hartree-Fock methods. f, An alternative possible ligand structure 
with sp” geometry at the carbon adjacent to the phenyl (highlighted in red 
dashed circle) was initially considered due to a better fit within the electron 
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density. This alternative ligand is predicted to be 2 Da smaller than BU72. 

g, In order to resolve potential ambiguity in the co-crystallized ligand, we 
performed mass spectrometry on the same protein sample used to generate 
crystals of the active LOR. The protein was precipitated in methanol and the 
supernatant was subjected to MALDI-MS which revealed a strong peak at 
m/z = 429.226, consistent with the expected mass of BU72. h, Shown is our 
final crystallographic model for BU72 within the F, — F, electron density. This 
model probably represents a high-energy conformation of BU72. Notably, the 
position of the morphinan scaffold is invariant between these alternative 
models for the crystallized ligand. 
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Extended Data Figure 5 | The N terminus of the OR interacts with BU72. _ contoured at 1.0¢ for the N terminus. c, Shown in green mesh is the F, — F. 
a, Surface cut-away view showing that the N terminus forms a lid over the omit map contoured at 4.00 of an unidentified density between BU72 and 
ligand-binding pocket. Shown in the lower panel is the ligand-binding pocketin — His54. 

the absence of the N terminus. b, Blue mesh shows the 2F, — F. omit map 
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Extended Data Figure 6 | Molecular dynamics simulation of active »OR 
bound to antagonist BU74. a, Structures of agonist BU72, and antagonists 
BU74 and B-funaltrexamine (fB-FNA). The inactive-state structure of 1OR was 
co-crystallized with B-FNA. b, BU74 was docked into the active-state structure 
of the OR based on the crystallographic pose of BU72, but in a molecular 
dynamics simulation it rapidly moves away from this initial pose. The middle 
panel highlights the movements of BU74 after 560 ns of simulation and the 


a a. 


Inactive LOR crystal structure 


Active HOR crystal structure 


rightmost panel shows the comparison of the BU74 pose as compared to the 
crystal structure of B-FNA bound to inactive OR. c, Molecular dynamics 
trajectory measuring the distance between the phenolic hydroxyl of Y326’* 
and the tertiary amine of BU74. Dotted lines show the distance between 
326’? and the same amine of BU72 in the crystal structure of active }OR and 
B-FNA in the structure of inactive OR. 
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Extended Data Figure 7 | Molecular dynamics simulation of inactive 

HOR bound to agonist B-FOA. a, Structures of agonists BU72 and 
B-fuoxymorphamine (B-FOA) and antagonist }-funaltrexamine (B-FNA). 

b, Molecular dynamics simulation of inactive 1OR with B-FOA docked into 
the same pose as B-FNA in the inactive-state crystal structure of HOR. B-FOA 
shifts towards TM3 with an accompanying rearrangement of TM3 
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residues D147*** and N150°*° towards the active-state structure. The overall 
ligand-binding pocket resembles the active state after 455 ns of simulation. 

c, Trajectory of the W293°"8 y, dihedral angle (indicated in the middle 
panel in b) over 700ns of simulation. In the presence of B-FOA, the 
preferred rotamer for W293°** rapidly approaches a conformation similar to 
the one observed in the structure of active WOR bound to BU72. 
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a Polar network residues in inactive states 
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Extended Data Figure 8 | Comparison of polar networks involved in GPCR 
activation. a, Residues involved in the polar network in the inactive state of 

the SOR (PDB ID: 4N6H) and conservation of those residues in B,AR, M2R, 

and rhodopsin. b, Residues involved in the polar network in active state 

HOR and conservation in BAR, M2R, and rhodopsin. c, Water-mediated polar 
network in the inactive structure of the OR involves residues from TM1, TM2, 


Inactive M2R 
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b Polar network residues in active states 
BW numbering HOR BAR M2R Rhodopsin 
1.50 86N 51N 41N 55N 
2.43 1071 721 62L 76L 
2.50 114D 79D 69D 83D 
3.32 147D 113D 103D 117A 
3.39 154S 120S 110S 
3.43 158L 124L) 114L 128L 
5.58 252Y 219Y 206Y 223Y 
6.40 285V 278) 3921 257M 
6.48 293W 286W 400W 265W 
7.38 3221 312N 426Y 292A 
7.41 325G 315G 429C 295A 
7.45 328N 318N 432N 298S 
7.46 3829S 319S 433S 
7.49 332N 322N 436N 302N 
7.53 336Y 326Y 440Y 306Y 
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TM3, TM5, TM6 and TM7. d, An identical view as in c of the polar network 
in the active LOR. e, Residues involved in the polar network in inactive 
structures of 5OR, B,AR and M2R are conserved both in sequence and 
conformation. f, In active HOR, B,AR and M2R, the residues within the polar 
network are again conserved in sequence and conformation. 
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Extended Data Figure 9 | Differences in TM6 polar network in opioid TM6 showing polar contacts. Notably, the inactive }OR engages in many more 
receptors and rhodopsin. a, The entire set of contacts within the polar network _ polar contacts with neighbouring residues as compared to inactive rhodopsin. 
that include a residue within TM6 is displayed for the inactive 5OR, active LOR, Additionally, the active states of both LOR and rhodopsin have fewer polar 
and inactive and active rhodopsin (Rho). b, Helix wheel representation of contacts than the inactive state. 
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Extended Data Table 1 | Data collection and refinement statistics (molecular replacement) 


OR-BU72-Nb39"* 


Data collection” 
Space group 
Cell dimensions 

a, b, c (A) 

a, B, y ©) 
Resolution (A) 
Rmmerge (%) 
<I/ol> 
CCi2 (%) 
Completeness (%) 
Redundancy 


Refinement 
Resolution (A) 
Number of reflections 
Ryvork/Rfree (%) 
Number of atoms 
Protein 
Ligand (BU72) 
Lipid,water and others 
B-factors (A’*) 
Protein 
Ligand (BU72) 
Lipid,water and others 
R.M.S. deviation from ideality 
Bond lengths (A) 
Bond angles (°) 
Ramachandran statistics® (%) 
Favored 
Allowed 
Outliers 


12,22) 


44.4, 144.0, 209.9 
90.0, 90.0, 90.0 
50.0-2.07 (2.12-2.07) 
11.2 (153.6) 

11.0 (1.7) 

99.2 (64.7) 

99.8 (100.0) 

8.6 (8.8) 


42.5-2.1 (2.15-2.10) 
39,948 
18.53/22.15 (25.83/28.71) 


3,278 
32 
208 


56.89 
39.50 
69.91 


“Diffraction data from 4 crystals were merged into a complete data set 
° Highest resolution shell statistics are shown in parentheses 


* As calculated by Molprobity 
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Growing the gas-giant planets by the gradual 


accumulation of pebbles 


Harold F. Levison!, Katherine A. Kretke! & Martin J. Duncan? 


It is widely held that the first step in forming gas-giant planets, 
such as Jupiter and Saturn, was the production of solid ‘cores’ each 
with a mass roughly ten times that of the Earth'’. Getting the cores 
to form before the solar nebula dissipates (in about one to ten 
million years; ref. 3) has been a major challenge for planet forma- 
tion models**. Recently models have emerged in which ‘pebbles’ 
(centimetre-to-metre-sized objects) are first concentrated by aero- 
dynamic drag and then gravitationally collapse to form objects 100 
to 1,000 kilometres in size*°. These ‘planetesimals’ can then effi- 
ciently accrete left-over pebbles’ and directly form the cores of 
giant planets'’'*. This model is known as ‘pebble accretion’; theor- 
etically, it can produce cores of ten Earth masses in only a few 
thousand years'’*. Unfortunately, full simulations of this pro- 
cess’* show that, rather than creating a few such cores, it produces 
a population of hundreds of Earth-mass objects that are inconsist- 
ent with the structure of the Solar System. Here we report that this 
difficulty can be overcome if pebbles form slowly enough to allow 
the planetesimals to gravitationally interact with one another. In 
this situation, the largest planetesimals have time to scatter their 
smaller siblings out of the disk of pebbles, thereby stifling their 
growth. Our models show that, for a large and physically reas- 
onable region of parameter space, this typically leads to the forma- 
tion of one to four gas giants between 5 and 15 astronomical units 
from the Sun, in agreement with the observed structure of the Solar 
System. 

Our models consist of a series of computer simulations that follow 
the evolution of a population of objects in a disk around the Sun. The 
solar composition disk has surface density distribution 2 oc ray 
where ray is the distance from the Sun, and consists of both gas and 
solids. We assume that an initial population of planetesimals, which 
follows the surface density of the disk, forms quickly and thus exists at 
the beginning of our simulations. These planetesimals contain only a 
small fraction of the mass of solids available (a free parameter of our 
models). Pebbles either also existed at the beginning of the simulation, 
or are allowed to form over some period of time (again, a free para- 
meter) starting at the beginning of the calculation. The evolution of 
this system is followed numerically and includes the effects of gravita- 
tional interactions, interactions between bodies in the disk and the gas 
(although nebular tidal migration is neglected), accretion (including 
enhancements due to the aerodynamic drag on pebbles; see, for 
example, ref. 11), and collisional fragmentation. Details of our setup 
and numerical techniques are described in Methods. 

In Fig. 1 we present the results of two different simulations that 
employ the same parameters except for the method of pebble forma- 
tion. In the first (Fig. 1a), we assume that the pebbles are left-overs of 
planetesimal formation and thus they exist at the beginning of the 
simulation. Note the fast accretion times—embryos (defined to be 
objects that grow significantly) evolved from having roughly the mass 
of Pluto to that of Earth in only 1,000 years. However, even though 
Earth-mass (Mg) embryos form quickly, this simulation does not 
reproduce the Solar System because rather than forming a few cores, 


it creates ~ 100 objects each of about 1 M@. These objects subsequently 
scatter one another to high-eccentricity, high-inclination orbits, stal- 
ling growth. This result is clearly inconsistent with observations. 
Reference 13 finds that this is a generic outcome for this type of pebble 
accretion simulation, and thus it could not have occurred in the Solar 
System. It is indicative of the results one would achieve if the dynamical 
interactions between planetesimals were neglected (which we refer to 
as the standard model). 

In Fig. 1b, we employ a substantial modification of the pebble accre- 
tion theory. We couple an extended timescale for pebble formation (a 
conjecture supported both by the observed presence of centimetre- to 
millimetre-sized grains, which are expected to drift rapidly out of 
protoplanetary disks, in disks of a wide variety of ages’* and by theor- 
etical models of pebble formation’’) to the dynamical stirring due to 
planetesimal self-gravity (so-called viscous stirring’®). The behaviour 
of this simulation is radically different from that in Fig. la in that, 
rather than producing a large population of objects each of 1 Me, afew 
giant planets form. Only 5 objects grow to 1 Mg or larger in this 
simulation, and there are two gas giants at 10 Myr. Also, note that 
the timescale for growth is very different in the two calculations. The 
first object of 1 Mg grows in 1,000 years in the standard pebble 
accretion run, as opposed to taking slightly over 400,000 years in this 
simulation. In the latter case, the growth time is determined by the rate 
at which pebbles were created. 

The outcomes of the two simulations are remarkably different 
because in the Fig. 1b simulation only the most massive embryos are 
able to accrete a significant amount of pebbles. This is due to the fact 
that the embryos grow slowly enough in this model that they can 
gravitationally interact with one another as they accrete the pebbles. 
This viscous stirring has two effects on the accretion rates of the 
embryos. First, it increases the relative velocities between the embryos 
and the pebbles and thus decreases the capture cross-section (equation 
(4) in Methods) of the embryos. Second, and more importantly, 
encounters between embryos lead to increases in the inclinations, i, 
of the orbits of the embryos and thus the distances they travel above 
and below the disk midplane. Once the inclinations of the embryos 
become larger than that of the aerodynamically damped pebbles, the 
embryos spend much of their time above or below the location where 
the bulk of the pebbles lie. This effectively starves them and stifles their 
growth. Owing to the role of viscous stirring in determining the out- 
come in these simulations, we refer to this process as viscously stirred 
pebble accretion. 

Figure 2 illustrates this effect. In the standard pebble accretion 
model (Fig. 2a), the system inclinations remain small and thus all 
the planetesimals can grow. This is because the timescale for viscous 
stirring is longer than that of the growth. In this case it can be shown 
(see Methods) that the growth rate of an embryo of mass M, can be 
described by dM,/dtoc M4 with f < 1, where t is the time since the 
start of the simulation. The small value of / implies that smaller 
embryos can catch up with larger ones, leading to a population of 
like-sized objects. 
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Figure 1 | The cumulative mass distribution of planetesimals and embryos. 
The growth of planetary embryos (of mass M) in our simulation as illustrated 
by the number of objects (N) greater than a given mass shown at various times 
(indicated by colour; see key at right). a, An example of standard pebble 
accretion—all pebbles were in existence at the beginning of the simulation. This 
is similar to ref. 13. b, An example with the same disk parameters as a but where 
the pebbles formed over the lifetime of the disk. This system formed two gas 
giants and three icy planets with masses between 2 Mg and 4 Mg. The icy 
planets are on crossing orbits at the end of the simulation (10 Myr) and thus 
their number and masses would be likely to change were the calculation run to 
completion. A video version of this figure is available (Supplementary Video 1). 


However, the dynamical evolution of the system is very different in 
the viscously stirred pebble accretion simulation. Here viscous stirring 
can act, thereby leading to an increase of inclinations. The magnitude of 
this increase is set by the mass of the largest embryo—note the increase 
with time of both the inclination of the smaller embryos and the largest 
embryo mass in Fig. 2b. Initially the inclinations of the embryos are 
smaller than those of the pebbles, but by 3,000 years, most of the smaller 
embryos begin to spend a significant amount of time above or below the 
pebbles. 

The time at which embryos are excited out of the swarm of pebbles 
can be estimated with a simple calculation. The inclinations of the 
pebbles are set by the balance between turbulent excitation and aero- 
dynamic drag in the disk. For our example, the average pebble inclina- 
tion is 0.0016 rad. Given enough time, a population of embryos will stir 
one another to the point where their relative velocities (v,.)) are com- 
parable to their surface escape velocities. Since inclination i ~ V;e/Vo, 
where vy, is the circular velocity, even if we assume objects do not 
grow, our population of planetesimals should reach inclinations of 
~0.06 rad—significantly larger than that of the pebbles. As a result, 
this effect should be important during much of our simulation. Indeed, 
Fig. 2b shows that most of the embryos in the system are in this state 
after only ~3,000 years. 

The most massive embryos, however, behave differently. They 
tend to have low inclinations (Fig. 2b) as a result of a combination 
of gravitational interactions with smaller planetesimals (so-called 
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Figure 2 | The vertical distribution of pebbles and embryos. A comparison 
between the vertical distribution, as represented by tan(i), of pebbles and 
embryos in the simulations shown in Fig. 1. The embryos are shown as dots in 
the figure, where their colour and shape indicates the time within the 
simulation (see key at top right). Since objects grow at different times within the 
disk, we only show embryos from a narrow annulus (11.5-13.5 Au). The time 
averaged cumulative inclination distribution of the pebbles is shown by the 
cyan gradient (colour key at far right). a, The run shown in Fig. 1a; b, our 
fiducial viscously stirred pebble accretion simulation from Fig. 1b. A video 
version of this figure is available (Supplementary Video 2). 


dynamical friction'®'’) and with the gas (so-called type I inclination 
damping’*). As a result, the larger embryos grow relatively quickly 
while the smaller embryos grow very slowly, if at all. Recall that the 
standard model of pebble accretion formed a large number of planets 
because f <1, which allowed smaller embryos to catch up with larger 
ones. Figure 3 shows the relationship between dM./dt and M, in our 
fiducial viscously stirred pebble accretion run as the system evolves. At 
early times, f is slightly larger than 1, leading to a few embryos becom- 
ing dominant in the population. When the largest objects in the system 
have masses between ~8 X 10°* Mg and ~0.02 Mg, their equilib- 
rium inclinations decrease, leading to a spurt of growth where f is large 
(~4). This allows a small number of embryos to become separated in 
mass from the rest, and explains why only a small number of embryos 
become massive enough to become giant-planet cores. Indeed, for the 
small embryos, dM,/dt decreases with time because of their increasing 
inclinations (see Fig. 2). But for M, = 0.02Mg, fh becomes small again 
as the strongly damped, large embryos accrete in the same manner as 
in the standard pebble accretion scenario. This last phase is important 
because it forces these proto-cores to have similar masses as they grow. 
Thus, they all reach the mass where they can directly accrete gas at 
roughly the same time. This last phase might also explain why the four 
cores of the giant planets probably originally had similar masses. Two 
gas giants grow in our fiducial simulation (perhaps Jupiter and Saturn). 
By varying parameters in the models, our systems produced between 0 
and 4 gas giants. Results of the simulations can be found in Extended 
Data Tables 1 and 2. 
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Figure 3 | Embryo growth rate as a function of mass. The temporal evolution 
of the relationship between embryo growth rate (dM./dt) and mass (M.,). The 
dots show our fiducial viscously stirred pebble accretion simulation (Fig. 1b); 
their colour and shape indicate the time within the simulation (see key at 
bottom right). The solid curve is from the standard pebble accretion run in 
Fig. 1a. These values were calculated using procedures described in Methods. 
The behaviour of the system is determined by the slope of the data in the figure, 
B (note that this is a log-log plot), at each time. For reference, the dotted lines 
show f = 1 and f = 4. For f < 1, small embryos can catch up with larger ones 
leading to a population of like-sized objects. For f > 1 the largest embryos run 
away from their smaller siblings. See Methods for more details. 


There is another constraint that any model of giant-planet 
formation in the Solar System must satisfy in order to be considered 
a success. The distribution of small bodies in the outer Solar System 
indicates that the orbits of the giant planets moved substantially after 
they formed’*". In particular, Uranus and Neptune are likely to have 
formed within 15 or 20 au of the Sun and to have been delivered to 
their current orbits by either a smooth migration” or a mild gravita- 
tional instability’. Both processes require that a population of planet- 
esimals existed on low-eccentricity orbits beyond the giant planets 
after the planets finished forming. This population must have (1) 
not formed planets, and (2) survived the planet formation process 
relatively unscathed. To evaluate this constraint in our simulations, 
we placed 5 planetesimals with radii similar to Pluto (s = 1,350 km) 
on circular, co-planar orbits with semi-major axes between 20 and 30 Au. 
None of these objects grew in our fiducial simulation, and they all 
survived on orbits with eccentricities less than 0.07 (see Methods 
for how this behaviour varies with disk parameters). Therefore, the 
process of viscously stirred pebble accretion reproduces the 
observed structure of the outer Solar System: two gas giants, a few 
icy planets, and a disk of planetesimals into which the ice giants can 
migrate. 
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and Source Data, are available in the online version of the paper; references unique 
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METHODS 


Computational method. For this work we have employed a particle-based 
Lagrangian code (known as LIPAD, Lagrangian Integrator for Planetary 
Accretion and Dynamics) that can follow the dynamical/collisional/accretional 
evolution of a large number of planetesimals through the entire growth process to 
become planets. For full details about the code and extensive test suites, see ref. 22. 
For details about how pebble accretion is implemented in the code see ref. 13, but 
we summarize the most relevant attributes of the code here. 

LIPAD is built on top of the N-body integrator SYMBA”™. In order to handle the 
very large number of sub-kilometre objects required by many simulations, LIPAD 
utilizes a concept known as tracer particles. Each tracer represents a large number 
of small bodies with roughly the same orbit and size, and is characterized by three 
numbers: the physical radius, the bulk density, and the total mass of the disk 
particles represented by the tracer. LIPAD employs statistical algorithms that 
follow the dynamical and collisional interactions between the tracers. When a 
tracer is determined to have been struck by another tracer, it is assigned a new 
radius according to the probabilistic outcome of the collision based on a frag- 
mentation law by ref. 24, using the ice parameters Q)=7X 10’ ergg ', 
B= 2.1 ergcem’® g *, a= —0.45 and b = 1.19. This way, the conglomeration of 
tracers represents the size distribution of the evolving planetesimal population. In 
this work, we do not allow our pebbles to collisionally grow or fragment, therefore 
particles below 1 km in size are not involved in the collisional cascade. 

LIPAD also includes statistical algorithms for viscous stirring, dynamical fric- 
tion, and collisional damping among the tracers. The tracers mainly dynamically 
interact with the larger planetary mass objects via the normal N-body routines, 
which naturally follow changes in the trajectory of tracers due to the gravitational 
effects of the planets and vice versa. LIPAD is therefore unique in its ability to 
accurately handle the mixing and redistribution of material due to gravitational 
encounters, including gap opening and resonant trapping, while also following the 
fragmentation and growth of bodies. Thus, it is well suited to following the evolu- 
tion of a population of embryos, planetesimals, and pebbles while they gravita- 
tionally interact to form planets. 

The pebble accretion model follows the prescription in ref. 10 and is described 
in detail in ref. 13. An overview of the physics follows in the next section. 

In the simulations presented here we do not allow the growing planets to 
migrate via type I migration’®, although we do include type I eccentricity damp- 
ing’. We also neglect type II migration. We calculate the aerodynamic drag on all 
bodies using the formalism of ref. 26. Additionally, as we are interested in the gross 
evolution of a system after the formation of a potential giant-planet core, we have 
added a simple optional prescription allowing cores to accrete gas, envelopes. 
In order to accrete gas, the core size must be above a critical value, which depends 
on the mass accretion rate of solids onto the core. We follow ref. 27 to determine 
when core masses are above the critical value given their current mass accretion 
rate (assuming a grey opacity of 0.02 times that of the interstellar medium”). If this 
criterion is met then we grow the planet of mass M, on the Kelvin-Helmholtz 


timescale (tx1) by accreting gas at a rate of M. g» So that: 
: M, 
Mg= — (1) 
KH 


Following ref. 28 we approximate this timescale as 


M.\73 
tk =10°( —2 2 
wn=10(T) yr 0) 
We limit gas accretion to the Bondi accretion rate, 
242 
41 p,G M; 


(3) 


g.max = 3 
where pg is the gas density and c, is the local sound speed. We note that a major 
uncertainty in these models is the envelope opacity, which is dominated by the 
poorly constrained properties of the dust. Varying the opacity dramatically alters 
both the size of the critical core mass, and perhaps more strikingly, the Kelvin- 
Helmholtz time. We make two major simplifications in our procedures. First, we 
arbitrarily cut off gas accretion when a planet reaches one Jupiter mass instead of 
including any physics to reduce the accretion of gas after gap opening (see, for 
example, ref. 29). Furthermore, we do not include the fact that pebbles will cease 
accreting onto planetesimal cores once the cores begin perturbing the disk*”. This 
effect may prove important in causing some cores to accrete gas sooner, or in 
allowing cores to grow to larger sizes without accreting gas (creating true Uranus 
and Neptune analogues). Therefore the end masses of giant planets should be 
viewed with these caveats in mind. 

We also note that while this model includes fragmentation, we have found that 
the production, and subsequent sweep-up of pebbles (as considered by ref. 31) is 
relatively unimportant in these scenarios. This is due to the assumed relatively low 
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initial mass of planetesimals in our simulations (17 Mq@ between 4 Au and 15 avin 
our fiducial simulation). 

Pebble accretion. Here we present an argument, based on refs 10, 11, about why 
pebbles are effectively accreted by growing planets. If the stopping time (t,) of a 
pebble is comparable to the time it takes to encounter a growing embryo, then it 
can be deflected out of the gas stream and lose enough orbital energy to become 
gravitationally bound to the embryo. After capture, the pebble spirals inward due 
to aerodynamic drag and is accreted. In this case the collisional cross-section for 


accretion is 
4GM.t, \ P tv, \? (4) 
Ope = ex] 
a Veal e 4GM, 


where M, is the mass of the embryo, v,.; is the relative velocity between the pebble 
and embryo, G is the gravitational constant, and y = 0.65 (ref. 10). When the 
Stokes number of a pebble (t = t,Qx, where Qx is the local Keplerian frequency) 
is near unity, this capture radius can be 10’ times larger than the physical cross- 
section alone (nR2, where R, is the radius of the embryo) for Earth-sized planets in 
the region inhabited by the giant planets. 

This process can cause an isolated 1,000 km object to grow into a 10 Mg core in 
only a few thousand years’’"*. To zeroth order, the accretion rate of an embryo 
growing by pebble accretion is dM,/dt = f,. R, where f,, is the fraction of pebbles 
that drift into an embryo’s orbit that will be accreted (also called the filtering 
factor, which is a function of o,., and the spatial distribution of pebbles), 
R=2KrZpebVrad is the rate at which pebbles are fed to the embryo, 5}, is the 
surface density of pebbles at heliocentric distance r, and V;aq is the radial velocity of 
the pebbles due to aerodynamic drag. Pebbles drift at a velocity 


Ve (5) 


Vrad = — P+1 
where 7 is a dimensionless parameter related to the pressure gradient of the gas 
disk*? and is of the order of a few times 10°, and 1; is the local circular velocity of 
the embryo. 

For t ~ 1, a pebble can spiral through the disk in only a few hundred years. 
These large values of v,.q can lead to huge accretion rates and giant-planet cores 
can potentially grow quickly. 

Pebble formation model. In this work we utilize a simple prescription to convert 
dust into pebbles over time. We assume that the gas disk with mass M, exponen- 
tially decays over a timescale f,, so that 


; M, 
M,=—- 8 (6) 
8 
Pebbles (with total mass M,.,) are formed at a rate proportional to the square of 
the dust mass (M4) such that 
Mpeb = kM3 (7) 


where k determines the rate of pebble formation. The dust in the disk will be lost as 
the gas disk evolves and as pebbles form, yielding a dust evolution of 


M 


; . Ma - 
Ma=M,—* —Mpeb (8) 
5 Mg P 
This leads to a total rate of pebble production of 
: K 1 1 2 
Mpel M. 9 
et eK? (um —«/G+ 3) 4 


where k = kt,M4,. Motivated by observations of disks, we take the time constant 
for pebble production to be large, with a median production timescale near 1 Myr. 
We assume that all of the pebbles are produced in 3 Myr. For simplicity, we assume 
that pebbles are randomly created throughout the disk according to the surface 
density, and the size of the pebbles is determined by the assumed t, which is 
constant in each simulation, but varies between calculations. 

To test the robustness of our result to the specific assumptions of our pebble 
formation model, we modified it in two different ways for the runs shown in 
Extended Data Tables 1 and 2. In some runs we allow the pebbles to grow as they 
drift inwards by sweeping up dust. Their growth rate is thus 


dm 


ae = Pat oer Veel (10) 


where pq is the dust mass (which is depleted as pebbles are formed, as dust is swept 
up, and as the gaseous disk evolves), rp is the size of the pebble, and v,. is the 
relative velocity between the pebble and the dust (which is assumed to be perfectly 
coupled to the gas). 

In other runs, we assume a model of pebble formation consistent with the inside 
out manner suggested by ref. 15. We used the implementation as described in 
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ref. 12 and had a wave of pebble formation which moves outwards as a function of 
time. In particular, the radius at which pebbles are generated (rg) at time t is 


1/3 
ra(t)= (=) (GM,)'7(egZo 72 (11) 
where M, is the mass of the Sun, Zp is the metallicity of the disk and ¢g encapsulates 
the efficiency of particle growth. We modify ¢q to adjust the timescale of pebble 
formation. We found that so long as the coagulation coefficient is small enough 
that the timescale for pebble formation is long, we can get results that are generally 
similar to our fiducial model in which pebbles form randomly. 
Model parameters and simulation statistics. Since we are interested in a simu- 
lation building the gas-giant planets, we study the growth of planetesimals spread 
from 4 to 15 Au. As we integrate the simulation forwards, pebbles are formed 
between 4 and 30 au. Each system was evolved for 10 Myr using LIPAD. Extended 
Data Tables 1 and 2 list the 42 simulations that we have completed in our invest- 
igation of giant-planet formation. The eight important parameters that were var- 
ied are as follows. 
(1) The surface density of the gas disk at 1 AU, Xo. In all our simulations we used a 
surface density distribution of =X ofau (ref. 33), where ray is the heliocentric 
distance in AU. For the fiducial simulation, Xo = 7,200 g cm”, which is 4 times the 
surface density of the minimum mass solar nebula at that location*’. The gas 
surface density decreases exponentially with a timescale of 2 Myr (refs 3, 35). 
(2) We employ a flaring gas disk with a scale height h = 0.047 rj, Au. We used two 
values for q: 1.25 from ref. 34 and 9/7 from ref. 36. The latter was used in our 
fiducial run. 
(3) The size of the largest planetesimal, s,.,,. We draw the initial planetesimals 
from a distribution of radii, s, of the form dN/dsocs~*5 such that s is between 
100 km (ref. 37) and s,,ax- For our fiducial simulation s,,., = 1,350 km, which is 
slightly larger than Pluto. 
(4) The fraction of solids in the disk that are initially converted into planetesimals, 
fp For all simulations, we assume a solid-to-gas ratio of 0.01, which includes the 
contribution of water ice**, and f,) of the solids are in planetesimals while (1 — f,1) 
are in pebbles. For the fiducial simulation, f,; = 0.1, which, if extended to 30 AU, is 
consistent with the mass of planetesimals needed to subsequently deliver the giant 
planets to their current orbits”. 
(5) The initial Stokes number of a pebble, t. Note that as pebbles spiral towards the 
Sun, their Stokes number changes (generally decreases) even when it is assumed 
that they do not grow. For our fiducial simulation t = 0.6. 
(6) The strength of inclination and eccentricity damping due to the gravitational 
interaction with the gas disk, c.. We employ the techniques in ref. 5, which are 
based on ref. 25, and allow for both radial migration and inclination/eccentricity 
damping. The strength of both processes can be adjusted by varying two dimen- 
sionless parameters: c, and c,, respectively. All our simulations have c, = 0, mean- 
ing there is no type I migration. For our fiducial simulation c. = 0.66. 
(7) LIPAD~ has the option of allowing pebbles to grow by the accretion of dust 
particles suspended in the gas. The growth rates are calculated using the particle- 
in-a-box approximation assuming the distribution of dust follows that of the gas, 
as described above. Our fiducial simulation has this option disabled. 
(8) The median time of pebble generation, tgen: For most of our simulations, pebble 
generation follows the evolution of the gas disk. In this case tgen ~ 0.7 Myr. 
However, in two cases we decreased f,., in order to determine whether it affects 
our results. We found that the value of ten is not important as long as it is larger 
than the viscous stirring timescale of the embryos. Times in Extended Data Tables 
1 and 2 marked with an asterisk indicate that pebbles were generated in an inside- 
out manner as described above. 

Additionally, all the disks are turbulent with « = 4 X 10-3 (ref. 39). 

The last five columns in Extended Data Tables 1 and 2 present the basic char- 
acteristics of our systems. In particular, N“** is the total number of gas-giant planets 
that formed during each run. It ranges from 0 to 4. Occasionally, a giant planet or 
two was lost from the system. Although some of these were ejected owing to 
dynamical instabilities, the majority were pushed off the inner edge of our computa- 
tional domain by their neighbours as the latter accreted gas. Unfortunately, the inner 
edge of the domain needs to be relatively large because it determines the timestep 
used in the calculation. The N-body integrator requires that the perihelion passage be 
temporally resolved, and since the location of the domain’s inner boundary deter- 
mines the smallest perihelion, it also sets the timestep. In order for the calculations to 
be practical, we were forced to remove any object with a perihelion distance smaller 
than 2.7 au (the location of the snow line). The majority of the lost gas giants would 
probably have remained in the asteroid belt or migrated into the terrestrial planet 
region if we had been able to keep them in the calculations. The column labelled 
N¢*S gives the number of gas giants remaining in the system at 10 Myr. 

Columns (11) and (12) in Extended Data Tables 1 and 2 refer to icy objects with 
masses greater than 1 Mg. Again, N““ and Ne are the total number of planets that 


formed and the number remaining at 10 Myr, respectively. Unlike their larger 
siblings, the majority of the lost icy planets were ejected from the planetary system, 
mainly by encounters with growing gas giants. The number of lost ice planets 
illustrates the fact that the system becomes violent after the largest cores begin to 
accrete gas. The last column shows whether the 5 planetesimals we placed on 
circular co-planar orbits between 20 and 30 au (as proxies for the planetesimal 
disk needed for later planet migration”’) survived at 10 Myr. 

There are a couple of important caveats that must be taken into account to 
properly interpret the results in Extended Data Tables 1 and 2. First, the parameter 
space for our calculations is 8-dimensional, and each calculation required many 
weeks of CPU time. This limited the total number that could be performed. Thus, 
it was impossible to cover parameter space in a meaningful way. Instead, we 
surgically approached the problem by choosing a couple of starting locations 
and varying individual parameters to test their effects. We then moved in a par- 
ticular direction in parameter space if we believed we would be more likely to 
produce systems with some desired characteristic. As a result, the distribution of 
our results seen in Extended Data Tables 1 and 2 (for example the number of gas 
giants) cannot be interpreted as the true distribution that would result if parameter 
space were uniformly covered. 

Our second caveat is the crude methods we used to model the direct accretion of 
gas onto the cores to form gas giants. As discussed above, our simple model for 
calculating the onset of gas accretion and the final mass of planets is probably 
missing important physics. These limitations mean that, although our calculations 
show that pebble accretion produces the correct number of giant-planet cores on 
the correct timescales, the details of the evolution of our systems after the gas giants 
form should be viewed with scepticism. 

Generating Fig. 3. The data in Fig. 3 were generated using the following proced- 
ure. We first needed to construct a high-resolution distribution of pebbles. This 
was accomplished by summing the output steps of our integration over all time. 
Then at each time plotted, we noted the mass and orbit of each embryo. We then 
generated 1,000 clones of each embryo—each with the same semi-major axis, 
eccentricity, and inclination, but with different orbital angles. For each clone, 
we calculated its velocity with respect to the pebbles and then o,,., from equation 
(4). From this, we estimate the instantaneous accretion rate using the particle-in-a- 
box approximation, M.= POpebVrel Where p is the local mass density of pebbles. 
The dots in the figure show the average of the 1,000 clones of the instantaneous 
accretion rates for each embryo. 

Code availability. The calculations presented in this paper were performed using 
LIPAD, a proprietary software product funded by the Southwest Research 
Institute that is not publicly available. It is based upon the N-body integrator 
SyMBA, which is publicly available at http://www.boulder.swri.edu/swifter/. 
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Extended Data Table 1 | Our completed simulations. 


Table 1: Our completed simulations. 
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(1) = (2) (3) (4), (5) (Bs () (8) | (9) (0) (11) (12) (13) 
Xo Gd Saw Fei 7 G “pebbles: te, | NV Ge NV es nie N¥ Disk ok? 
grow? 

(g/cm?) (km) (Y/N) (Myr) (Y/N) 
3600 9/7 1500 02 03 1 N 0.7 4 2 12 «0 a, 
3600 9/7 1500 0.2 03 1 N 0.5 4 a 14 0 x 
3600 9/7 1500 0.2 03 1 N 0.2 4 a 15 1 N 
3600 9/7 1500 0.2 04 1 N 0.7 3 1 9 2 x 
3600 1.25 1500 0.2 3 = #41 Y 0.7 3 2 9 0 ¥. 
4050 9/7 1500 0.18 0.4 1 N 0.7 4 2 6 1 ed 
4050 9/7 1500 0.18 0.5 1 N 0.7 3 3 it 0 ¥ 
4050 9/7 1500 0.18 0.6 1 N 0.7 4 4 9 0 Ed 
4500 9/7 1500 0.16 0.3 1 N 0.7 4 3 Ia xy 
4500 9/7 1500 0.16 0.4 1 N Of. 3 2 7 1 1 
4500 9/7 1500 0.16 0.5 1 N 0.7 3 3 13 (OO N 
4500 9/7 1500 0.16 0.5 0.33 N 0.7 4 2 4 0 Y 
ADO 8/7 1500- 0.16. 0.5.0.1 N 0.7 0 0 7 6 Y 
4500 9/7 1500 0.16 0.6 1 N 0.7 2 2 1? 38 N 
4500 9/7 1500 0.16 0.7 1 N 0.7 0 0 ig ‘G 5 
4500 9/7 1500 0.16 0.9 1 N 0.7 1 0 1 1 Bs 
5400 1.25 1500 015 3 1 * 0.7 0 0 5 4 B 
5400 1.25 1500 0.15 3 1 ~ 0.7 0 0 6 6 Bg 
5400 1.25 1500 0.15 3 0.66 x 0.7 0 0 5 5 Y 
5400 1.25 1500 0.15 3 0.33 pa 0.7 1 1 4 4 Y 
7200 9/7 1350 0.25 06 1 N 0.7 4 4 10 2 yy 
7200 9/7 1350 0.15 06 1 N 0.7 4 4 8 0 N 
7200 1.25 1500 01 1 1 x¥ 0.7 2 2 11 4 y 
7200 1.25 1500 0.1 1 0.25 Y 0.7 3 3 5 1 ¥ 
7200 1.25 1500 01 1 = O Y 0.7 2 2 i 0 x. 
7200 9/7 1350 01 O06 1 N OF 2 Z 8 2 ¥ 
7200 9/7 1350 0.1 0.6 0.66 N 0.7 2 2 9 3 ¥ 
7200 9/7 1350 0.1 0.6 0.66 N 7 3 3 5 1 x 
7200 9/7 1350 0.05 06 1 N 0.7 2 2 2 1 ¥ 
7200 9/7 1350 0.025 0.6 1 N 0.7 3 3 13 OO N 


See Method 


s for a full description of the columns in this table. 
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Extended Data Table 2 | More simulations. 


Table 2: More Simulations. 


(1) (2) (3) ©) (5) ©) ) (8) 7 (9) (0) (11) (12) (13) 
Xo Gd Swae ft T ce pebbles. tay iV oes Ne Nie N¥e Disk ok? 
grow? 

(g/cm?) (km) (Y/N) (Myr) (Y/N) 
900 9/7 1500 0.8 0.1 0.66 N 0.9* 0 0 21 615 » 
900 9/7 1500 0.8 0.1 0.66 N La 0 0 17 «14 Y 
1800 9/7 1500 0.4 0.1 0.66 N 0.9* 5 4 0 N 
1800 9/7 1500 0.4 0.1 0.66 N 1,3 3 3 0 N 
3600 9/7 1500 0.2 0.1 0.66 N 0.9* 7 5 I -Q N 
3600 9/7 1500 0.2 0.1 0.66 N 1.3” 6 4 9 0 N 
3600 9/7 1500 0.2 0.1 0.66 N 0.4* 4 3 9 0 N 
3600 9/7 1500 0.2 0.6 0.66 N 0.9* 5 3 6 0 N 
3600 9/7 1500 0.2 0.6 0.66 N 0.4* 2 2 ll 4 Y¥ 
7200 9/7 1500 0.1 0.6 0.66 N 0.4* 3 il 101 N 
7200 9/7 1500 0.1 0.6 0.66 N 0.9* 4 a 6 0 N 
7200 9/7 1500 0.1 0.6 0.66 N 13 5 | 5 1 N 


This is a continuation of Extended Data Table 1. See Methods for a full description of the columns in this table. 
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Measurement-based control of a mechanical 
oscillator at its thermal decoherence rate 


D. J. Wilson’, V. Sudhir’, N. Piro', R. Schilling’, A. Ghadimi! & T. J. Kippenberg' 


In real-time quantum feedback protocols'’, the record of a con- 
tinuous measurement is used to stabilize a desired quantum state. 
Recent years have seen successful applications of these protocols in 
a variety of well-isolated micro-systems, including microwave 
photons’ and superconducting qubits*. However, stabilizing the 
quantum state of a tangibly massive object, such as a mechanical 
oscillator, remains very challenging: the main obstacle is envir- 
onmental decoherence, which places stringent requirements on 
the timescale in which the state must be measured. Here we describe 
a position sensor that is capable of resolving the zero-point motion 
of a solid-state, 4.3-megahertz nanomechanical oscillator in the 
timescale of its thermal decoherence, a basic requirement for real- 
time (Markovian) quantum feedback control tasks, such as ground- 
state preparation. The sensor is based on evanescent optomechanical 
coupling to a high-Q microcavity’, and achieves an imprecision four 
orders of magnitude below that at the standard quantum limit for 
a weak continuous position measurement°—a 100-fold improve- 
ment over previous reports’ °—while maintaining an imprecision- 
back-action product that is within a factor of five of the Heisenberg 
uncertainty limit. As a demonstration of its utility, we use the 
measurement as an error signal with which to feedback cool the 
oscillator. Using radiation pressure as an actuator, the oscillator is 
cold damped” with high efficiency: from a cryogenic-bath temper- 
ature of 4.4 kelvin to an effective value of 1.1+0.1 millikelvin, 
corresponding to a mean phonon number of 5.3 + 0.6 (that is, a 
ground-state probability of 16 per cent). Our results set a new bench- 
mark for the performance of a linear position sensor, and signal 
the emergence of mechanical oscillators as practical subjects for 
measurement-based quantum control. 

Feedback control of mechanical oscillation is found in many appli- 
cations, ranging from stabilizing the approximately-kilogram test 
mass in a gravity-wave interferometer’ to trapping/cooling of 
atomic’* and sub-atomic particles’. A basic approach to feedback 
control uses a sensor to track the position of the oscillator and an 
actuator to convert the measurement record into a continuous and 
prompt (‘real-time’) feedback force. Recently, the quantum limits of 
continuous feedback” have been explored, enabling demonstrations 
such as microwave Fock-state generation’ and persistent Rabi oscilla- 
tions in a superconducting qubit*. These protocols employ a ‘weak 
measurement’ capable of tracking a quantum state as rapidly as it 
decoheres due to measurement back-action™*. For mechanical oscilla- 
tors, ideal weak position measurements® have been available since the 
advent of the laser, in the context of shot-noise-limited interfero- 
metry’*. Only recently, however, with the confluence of low-loss, cryo- 
genic micromechanics and on-chip, integrated photonics (as explored 
in contemporary cavity optomechanics’®), has it been feasible to con- 
sider their application to quantum feedback protocols'”'*. The main 
challenge is thermal noise, which places stringent requirements on the 
timescale in which the measurement must take place. 

Feedback cooling’*!*’”?”° is a well-studied control protocol that 
illustrates both the utility and the challenge of quantum feedback 
applied to mechanical systems. In feedback cooling protocols, a mech- 


anical oscillator undergoing thermal Brownian motion is steered 
towards its ground state by minimizing a measurement of its displace- 
ment, S, (here expressed as a power spectral density® evaluated at the 
mechanical oscillator frequency, 2). A conventional strategy’ is to 
apply a feedback force proportional to the time derivative of the mea- 
surement (which estimates the velocity of the oscillator). The resultant 
damping acts to reduce (cool) the motion of the oscillator until it 
coincides with the measurement imprecision, Sp. Cooling the oscil- 
lator until it spends most of the time in its ground state 
(amean phonon occupancy of ny, < 1) is possible if Si™? remains lower 
than the zero-point fluctuations of the damped oscillator, that is, if 
SimP < §’P /ny, (see Supplementary Information), where S”? is the spec- 
tral density of intrinsic (absent feedback) zero-point fluctuations and 
Mp, is the phonon occupancy of the thermal bath. (Ss. =4x? /T is 
proportional to the ratio of the ground-state variance te =h/ (2 2mQm) 
and the intrinsic mechanical damping rate I\,, where ne is the reduced 
Planck’s constant and m is the effective mass of the oscillator.) In the 
frequency domain, this condition on S'™? amounts to resolving the 
intrinsic thermal displacement S,~2n,S‘?P with a signal-to-noise 
ratio of S,/S'™? > 2nj,; in the time domain, it corresponds to resolving 
the zero-point motion at a characteristic ‘measurement’ rate® 


2 
T = Xap > Pn 
meas = imp ~ 
2Sx 8 


(1) 


where J, ~ [Mth is the thermal decoherence rate of the oscillator. 
Equation (1) implies the ability to resolve a displacement of x,, in the 
timescale over which a single phonon enters from the thermal bath, 
T7;,'. Satisfying this requirement is a technically daunting challenge 
because of the small x,, and large I‘), of typical engineered mechanical 
oscillators. As a consequence, despite the success of autonomous feed- 
back*!”’, ground-state cooling using measurement-based feedback 
has yet to be demonstrated. 

An additional, fundamental caveat compounds the challenge of 
ground-state cooling and hints at the underlying virtue of quantum 
feedback: Heisenberg’s uncertainty principle predicts that a weak 
(Tneas K Qym) continuous position measurement with an imprecision 
of S’P /2 will produce a stochastic ‘back-action’ force that disturbs the 
position of the oscillator by at least the same amount’. By inference, 
a thermal-phonon-equivalent imprecision of nimp =S."? /(2S:) 
results in an effective increase of the thermal bath occupation by 

ba = 1/(16"imp) (see Supplementary Information). This penalty 
would appear to prohibit ground-state cooling, as it entails substan- 
tially heating the oscillator to achieve the necessary measurement 
precision. Remarkably, however, feedback counteracts back-action™*, 
so that a phonon occupancy of Mm ~ 2\/Nimp (Mba + Nth) — 1/2 <1 (see 
Supplementary Information) can still be achieved'’”’. The limiting 
case of n,, > 0 is approached when the measurement record is domi- 
nated by back-action-induced fluctuations. This occurs when the mea- 
surement is maximally efficient", that is, when the measurement rate, 
Imeas = Im/(16Nimp), approaches the effective thermal decoherence 
rate, Tot ~ (Mn+ Mpa)L im To meet this condition for a typical 
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engineered oscillator, a linear position sensor must achieve an impre- 
cision far (about ny times) below the natural scale set by the ‘standard 
quantum limit’ (SQL)° (timp = Mba = 1/4), while maintaining back 
action near the uncertainty limit: 4, /MimpMba = 1. 

Coupling micromechanical oscillators to optical cavities has 
emerged as a promising way to meet the above requirements. 
Transduction in such ‘cavity-optomechanical’ systems’® arises from 
a parametric coupling, G = 0w,/0x, between the position of the oscil- 
lator and the resonance frequency @, of a cavity. For broadband sens- 
ing, characterized by a cavity decay rate K >> Qy, a resonant laser field 
passing through the cavity acquires a phase shift of 2G8x/x; this can be 
resolved in a homodyne interferometer with a quantum-noise-limited 
imprecision of S'™P = (8G?n,y/K) ~1 where n, is the mean intracavity 
photon number and 7 € [0,1] is the effective photon collection effi- 
ciency (see Supplementary Information). The associated measurement 
rate is I meas = 4g" nn/K =I mConcn, where go = Gx,p is the vacuum 
optomechanical coupling rate and Cp = 4g5 /(K Im) is the ‘single-photon 
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cooperativity’'®, which characterizes the per-photon measurement rate. 


To achieve efficient measurements, contemporary cavity-optomechani- 
cal systems combine the state-of-the-art in high-Q nanomechanics and 
microphotonics’®. As a consequence, imprecision below that at the 
SQL’”, as well as quantum back-action (that is, radiation pressure shot 
noise’’)”*”* has recently been observed. In none of these experiments, 
however, was I reas ~ J tn demonstrated at the photodetector, owing to a 
combination of large thermal occupation, extraneous imprecision, 
optical loss and dynamic instabilities. (An electromechanical device 
operating deep in the “good-cavity’ limit, « < Qy, has recently achieved 
Imeas > In (ref. 22); however, the use of a far-off-resonant probe 
resulted in strong dynamic coupling of the optical and mechanical 
mode.) 

Our system addresses these challenges using near-field optomecha- 
nical coupling®, a paradigm for combining mechanical and optical 
resonators with differing material and geometry. Building on earlier 
work’”’, we couple a mechanical oscillator with an exceptionally 
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Figure 1 | Measuring and controlling the position of a nanomechanical 
beam using a near-field optomechanical transducer. a, Whispering gallery 
modes of a SiO, microdisk are excited using a tapered optical fibre driven by a 
pair of tunable diode lasers (sensor and feedback). Displacement of a Siz3N4 
nanobeam, which samples the evanescent mode volume of the microdisk, is 
recorded in the phase of the transmitted sensor field using a balanced 
homodyne detector (LO, local oscillator field). Radiation-pressure feedback is 
applied by modulating the amplitude of the feedback laser with an 
electronically processed (delayed, bandpass-filtered and amplified) copy of the 
homodyne photocurrent (AM, amplitude modulation). Sample and fibre are 
both embedded in a *He cryostat operating at 4.4K. b, Above, finite element 
model of the optical mode (colour scale shows normalized field amplitude), 
shown in vertical cross-section. Wedged disk and beam (separated by 50 nm) 
are outlined in white. Optomechanical coupling is proportional to the intensity 
gradient at the position of the beam. Below, scanning electron micrograph of 
the device. False-colouring indicates material: blue, SiOz (microdisk); red, Si3N4 
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(nanobeam); grey, Si (wafer substrate). c, Thermal displacement of the 
fundamental beam mode, expressed as a power spectral density of cavity 
frequency noise, S,,(Q) (where Q is the angular Fourier frequency). Various 
measurement strengths (different n,) are shown. In units scaled to the zero 
point spectral density, S?, the peak and background correspond to noise 
quanta nor and nimp, respectively (see text). d, Broadband (shot-noise- 
subtracted) homodyne signal expressed as apparent cavity-frequency noise; the 
spectral density is divided by 27 because « is an angular frequency. Solid 

red line corresponds to measurement data; dashed blue, green, and black lines 
correspond to estimated contributions from nanobeam thermomechanical, 
microdisk thermomechanical, and microdisk thermorefractive noise, 
respectively. Coloured bands denote the imprecision required for Iineas = Ith 
at different bath temperatures T, as labelled: 5)? = g?HQm/(2kgT), assuming 
% 1/Q and Qn = 7.6 X 10°. Noise peaks arising from microdisk and 
nanobeam thermal motion are labelled by numerical models of the 
corresponding mode shapes (not to scale). 
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high Q/(mass) ratio and low optical absorption—a high-stress Si;N4 
nanomechanical beam—to an optical cavity with a high Q/(mode 
volume) ratio and low optical nonlinearity—a SiO, microdisk—by 
localizing a portion of the beam within the evanescent volume of one 
of the whispering gallery modes of the microdisk (Fig. 1b). The system 
is integrated onto a silicon chip, allowing for robust cryogenic opera- 
tion. In contrast to earlier work, we employ a fabrication technique (see 
Supplementary Information) that substantially reduces the distance 
between the mechanical and optical element, enabling separation of 
the SisN4 and SiO, surfaces by 10— 100 nm. In conjunction with cryo- 
genic operation, we realize an enhancement of Cy by nearly five orders 
of magnitude compared to the device reported in ref. 27, while reducing 
the thermal bath occupation by two orders of magnitude. 

We study a system consisting of a 65 um X 400 nm X 70 nm (effec- 
tive mass m ~ 2.9pg) nanobeam placed approximately 50 nm from 
the surface of a microdisk with a diameter of 30 lum. The microdisk is 
optically probed using a low-loss (approximately 6%) fibre taper 
that is pumped by a tunable diode laser. Displacement of the nano- 
beam is observed in the phase of the transmitted cavity field using a 
balanced homodyne interferometer. We interrogate two optical 
modes: a ‘sensor’ mode (used for readout) at 1, ~ 775 nm that exhibits 
an intrinsic decay rate of Ko~ 2m X0.44GHz and a ‘feedback’ 
mode (used for radiation pressure actuation) at A.~843nm for 
which Kj ~ 2m X 1GHz. For the mechanical oscillator, we use the 
Q,, ~ 21 X 4.3 MHz fundamental out-of-plane mode of the nano- 
beam. The vacuum optomechanical coupling rate between the oscil- 
lator and the sensor mode is go ~ 2m X 20 kHz, which corresponds 
to a ‘frequency pulling factor’ of G~2n X 0.70GHznm ' for the 
estimated zero-point amplitude of x,,, ~ 29 fm. Our experiments are 
conducted in a *He buffer gas cryostat*? at a bath temperature of 
T44K (ny ~kpT/(hQm) ~2.1 x 10*) and at gas pressures below 
10 * mbar. Ringdown measurements reveal a mechanical damping 
rate of [\,~2n%5.7Hz (corresponding to a quality factor 
Qin = Qml Pm ~ 7.6 X 10°). Our system thus allows for a near-unity 
single-photon cooperativity of Co ~ 0.64. 

All position sensors are limited by extraneous thermal noise. In 
optical-cavity-based sensors, major sources include thermomechani- 
cal and thermorefractive fluctuations of the cavity substrate*. These 
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fluctuations result in excess cavity-frequency noise, So, and limit 


the measurement rate to 
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where Sj," is the photocurrent shot noise referred to an appar- 
. ; hot _ cimp,shot 

ent cavity resonance-frequency noise, n;,,=S;;?°"° /2S? and 


= Sa? /2S2. Figure 1d shows the extraneous-noise floor of 


our sensor over a broad range of frequencies. We obtained this 
spectrum by subtracting shot noise from a measurement made with 
a large intracavity photon number, n,> 10°. (To mitigate dynamic 
instabilities, the measurement was conducted using approximately 
10mbar of gas pressure at an elevated temperature of 15.7 K.) 
High- and low-Q noise peaks correspond to thermal motion of 
the nanobeam and the microdisk, respectively. Near the fun- 
damental noise peak, we observe an extraneous frequency noise of 


gimp.ex = 


oP = (20 x 30 Hz/ Viz)’, corresponding to a displacement impre- 
cision of SP ~ (4.3 x 107!” m/WHz) * We identify this imprecision 
as a combination of thermorefractive noise®, diode laser frequency 
noise’® and off-resonant thermal motion of the in-plane nanobeam 
mode at 4.6 MHz. Owing to the large zero-point motion of the oscil- 
lator, S?? = 4g? /Im = (21 x 6.7 kHz/WHz)” (corresponding to S? = 
(0.95 x 10-4 m/WHz)*), the thermal-phonon-equivalent mag- 
nitude of Simp.ex has an exceptionally low value of Yenp ~1.0x 107, 
nearly 44dB below that at the SQL. The associated measurement 
rate, I'm /(16nj;,,)~ 2m x 36 kHz, is equal to the thermal decoher- 
ence rate at an experimentally accessible temperature of 1.3 K, which 
implies that ground-state cooling (Iwmeas > Itn/8) should be access- 
ible at 10K. 

The performance of the sensor is limited by optical loss, photother- 
mal and radiation-pressure instabilities, and extraneous sources of 
measurement back-action. We investigate these constraints by record- 
ing the measurement imprecision, nimp, and the effective bath occu- 
pation, Mot = Mn + Mpa as a function of intracavity photon number, 
and comparing their product to the uncertainty-limited value, 
4, /MimpMot = 1 (Fig. 2). Two considerations are crucial. First, to 
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Figure 2 | Measurement imprecision and back action versus intracavity bath occupation, ny + Mpa + Nimp- Dashed red and blue lines are models of 


photon number. Red, blue and green points correspond to measurements 
of effective bath occupation, mot = Mth + Mpa, Measurement imprecision 
(expressed as an equivalent thermal phonon occupation of the oscillator), timp, 
and effective imprecision—back-action product, 4,/nimp Mot, respectively. Solid 
red, blue and black lines correspond to models of ideal (lossless, quantum- 
limited) back-action, m, = Con, imprecision, imp = 1/(16Con,), and their 
SUM, Mpa + Nimp, Fespectively. Dashed black line represents the ideal apparent 


experimental data, which include contribution from extraneous back action, 
Cy* = 0.56, extraneous imprecision, ni*,, =0.70 x 10— 5 and imperfect detection 
efficiency, € = 0.23, as described in the main text. Dashed green line is a model of 
the effective imprecision—back-action product using equation (2). Insets 
highlight the measurement region and have the same axes labels as the main 
plot. Vertical and horizontal error bars are due to systematic uncertainties in nin 
and Con-, respectively (see Supplementary Information). 
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efficiently collect photons from the cavity, it is necessary to increase the 
taper-cavity coupling rate to Kex Z Ko, thereby increasing the cavity 
decay rate to K = ky + K,,. We operate at a near-critically coupled 
(Kex = Ko) value of «~2n X 0.91GHz, which reduces the single 
photon cooperativity to Co ~ 0.31 in exchange for an output coupling 
efficiency of 4. = (Kk — Ko)/K ~ 0.52. Second, to minimize simp, it is 
necessary to maximize n, while avoiding dynamic instabilities. We 
accomplish this by damping the oscillator using radiation pressure 
feedback. Feedback is performed by modulating the intensity of the 
feedback laser with an electronically amplified and delayed copy of the 
homodyne photocurrent. A feedback phase of —31/2 is chosen by 
tuning the total delay, t~172ns, such that the feedback-induced 
spring effect is minimized’’ (see Supplementary Information). The 
resulting viscous radiation pressure gives rise to a well-known cooling 
effect (‘cold-damping’®”’), which reduces the phonon occupancy of 
the mechanical mode to a mean value of Mm ~ Mott m/(U im + Ip), 
where Jy is the optically induced damping rate. 

Measurements of the thermal motion of the oscillator are shown in 
Fig. 1c. We determine mot and nimp by fitting each noise peak to a 
Lorentzian with a linewidth of Prag = I9y + I + Iba (which includes 
a minor contribution from dynamic back action, [},,; see Supplemen- 
tary Information), a peak amplitude of S,,(Qq) ~ 2Mtotr(M'mn/L et) SP 
and an offset of Sj)? = 2MimpSer . For low input powers, ne<nn/Co, 
we observe that the effective bath occupation is dominated by therma- 
lization to the cryostat, mot Mn, and that imprecision scales as 
Nimp = (16€Con,) * where € ~ 0.23. € represents the ideality of the mea- 
surement, and includes optical losses as well as reduction of optomecha- 
nical transduction due to cavity mode splitting (see Supplementary 
Information). When operating with the higher input power—ultimately 
limited by dynamic instability of higher-order beam modes—the lowest 
imprecision we observe is nimp = 2.7(+0.2) X 10°, which corresponds 
to an imprecision 39.7+0.3dB below that at the SQL (see 
Supplementary Information for the budget of uncertainties). The 
associated measurement rate, J eas = 20 X (13 + 1) kHz, is a factor 
of 9.2 lower than the rate of thermal decoherence to the ambient 4.4 K 
bath, I, = 2m X 120 kHz. Notably, this value is within 15% of the 
requirement for ground-state cooling. 


For large measurement strengths, quantum back-action” should in 
principle exceed the ambient thermal force, scaling as n,, = Con, (see 
Supplementary Information). As shown in Fig. 2 (red data), our system 
deviates from this ideal behaviour, owing to excess back-action. The 
back-action manifests as an extraneous cooperativity, C>*~0.56, 
and limits the fractional contribution of quantum back-action to 
Co/ (Co + C&) ~ 35%. Similar behaviour for high-order mechanical 
modes suggests that photothermal heating is the cause of this excess 
heating, as does our observation that C>* is markedly higher at lower 
cryostat temperatures, which is consistent with the reduction of ther- 
mal conductivity in amorphous glasses below 10 K (see Supplementary 
Information). Including extraneous back-action, we model the effec- 
tive imprecision—back-action product (green curve in Fig. 2) as 
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where n*=(16ECon%*,)~’ is the photon number for which 


imp 


shot __ eX Using n.~5 X 10¢<«n*, we observe a minimum value 


imp imp’ 
of 4, /Himp Mot ~ 5.0, which corresponds to a measurement efficiency 
of Dmneas! Trot ~ 0.040. 

To illustrate the utility of this high efficiency, we increase the 
strength of the feedback used to cold-damp the oscillator in Fig. Ic. 
The limits of cold-damping are well-studied’*’”*°. Ignoring back- 
action from the weakly driven (n, < 100) feedback optical mode, the 
phonon occupancy of the cooled mechanical mode depends on the 
balance between its coupling to thermal, measurement and feedback 
reservoirs at rates Iy,, ImMpa and gpl mMimp, respectively, where 
Sm =LMp/Im is the open loop feedback gain (see Supplementary 
Information): 


n n 
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T 
(here we assume I"p, < Isp and Mot >>1). The photon occupancy ny 
is minimized for an optimal gain of gp = \/Mtor/ Nimp» Which corre- 
sponds to suppressing S, to Si™P (black curve in the inset of Fig. 3). 
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Figure 3 | Radiation-pressure feedback cooling to near the ground state. 
Red and blue points correspond to measurements of the phonon occupancy 
of the mechanical mode, n,, (plus a phonon-equivalent zero-point 

energy of 1/2), and its component due to feedback of measurement noise, 
Mm, fo = Nimp, Sa /(1+gm), respectively, versus measured damping rate, 

Tepe = (1 + gay) Im. Red, blue and black dashed lines correspond to models 
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of the components in equation (3): Mtot/(1 + gay); Mm,» and Nm + 1/2, respect- 
ively, using experimental parameters I), = 2% X 5.7 Hz, myot = 2.4 X 10° and 
Nimp = 2.9 X 10 *. Inset, thermomechanical noise spectra for various feedback 
gain settings (represented by the differently coloured curves); Lorentzian fits 
to these spectra were used to infer the red and blue points. Error bars are due 
to systematic uncertainty in /,o:(see Supplementary Information). 
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Notably, for Co =0, mm <1 requires imp < 1/(2n,). The results pre- 
sented in Fig. 2 suggest that n,, ~ 2 should be achievable. 

Figure 3 shows the results of feedback cooling using a measurement 
with an imprecision far below that at the SQL. We emphasize that 
for this demonstration, imprecision was deliberately limited to 
Nimp ~ 2.9 X 10“ to reduce uncertainties due to extraneous heating 
and the off-resonant tail of the thermal noise peak at 4.6 MHz. (The 
latter restriction limits the applicability of equation (3) to effective 
damping rates of I'ee¢ =(1+ 85) Im S20 x 200 kHz.) Feedback gain 
was controlled by changing the magnitude of the electronic gain, leav- 
ing all other parameters (for example, laser power) fixed. The 
Markovianity condition t<2z/I'n is satisfied by the feedback 
delay'*. By fitting the closed-loop noise spectrum (Fig. 3, inset) to a 
standard Lorentzian noise-squashing model’* (see Supplementary 
Information), we estimate the phonon occupancy of the mechanical 
mode using Mm +0.5~ Tet (So(Qnm) +S") /(2SP), where So? 
denotes the off-resonant background. After accounting for extraneous 
back-action, we infer a minimum occupation of n,, ~ 5.3 + 0.6 (see 
Supplementary Information for budget of uncertainties) at an optimal 
damping rate of Ig ~ 2m X 52 kHz, which corresponds to a ground- 
state probability of 1/(1 + n,,) ~ 16%. This value agrees well with the 
prediction based on equation (3) and the data shown in Fig. 2. Notably, 
for larger feedback strengths, shot-noise ‘squashing’ leads to an appar- 
ent reduction of S,, even though n,, physically increases. This discrep- 
ancy can be resolved with an out-of-loop measurement”, at the cost of 
reduced measurement efficiency. 

Collectively, our results establish new benchmarks for the linear 
measurement and control of a mechanical oscillator. Using an opto- 
mechanical sensor with a readout imprecision that is nearly 40 dB 
below that at the SQL, we have shown that traditional radiation pres- 
sure cold-damping”’ can be used to cool a nanomechanical oscillator 
to a mean phonon occupancy of approximately 5.3; this represents 
a 40-fold improvement over previous reports'’”*”, and invites 
comparison” to the recent success of autonomous feedback (side- 
band) cooling”’”’. Looking forward, high-efficiency optomechanical 
sensors may enable a variety of feedback applications such as back- 
action evasion”™* and mechanical squeezing”. 
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Switching on elusive organometallic mechanisms 


with photoredox catalysis 


Jack A. Terrett!, James D. Cuthbertson!, Valerie W. Shurtleff! & David W. C. MacMillan! 


Transition-metal-catalysed cross-coupling reactions have become 
one of the most used carbon-carbon and carbon-heteroatom 
bond-forming reactions in chemical synthesis. Recently, nickel 
catalysis has been shown to participate in a wide variety of C-C 
bond-forming reactions, most notably Negishi, Suzuki-Miyaura, 
Stille, Kumada and Hiyama couplings'”. Despite the tremendous 
advances in C—C fragment couplings, the ability to forge C—O 
bonds in a general fashion via nickel catalysis has been largely 
unsuccessful. The challenge for nickel-mediated alcohol couplings 
has been the mechanistic requirement for the critical C-O bond- 
forming step (formally known as the reductive elimination step) to 
occur via a Ni(111) alkoxide intermediate. Here we demonstrate that 
visible-light-excited photoredox catalysts can modulate the pre- 
ferred oxidation states of nickel alkoxides in an operative catalytic 
cycle, thereby providing transient access to Ni(II) species that 
readily participate in reductive elimination. Using this synergistic 
merger of photoredox and nickel catalysis, we have developed a 
highly efficient and general carbon-oxygen coupling reaction 
using abundant alcohols and aryl bromides. More notably, we have 
developed a general strategy to ‘switch on’ important yet elusive 
organometallic mechanisms via oxidation state modulations using 
only weak light and single-electron-transfer catalysts. 

Visible-light-mediated photoredox catalysis has gained momentum 
over the past decade as a platform for the development of novel syn- 
thetic transformations via the implementation of non-traditional 
open-shell mechanisms. This catalysis field employs transition metal 
polypyridyl complexes or organic dyes that, upon excitation by visible 
light, readily engage in an array of single-electron transfer (SET) pro- 
cesses* that often include oxidation, reduction or redox pathways 
that have previously been elusive*’. Indeed, several research groups 
have demonstrated that photoredox catalysis can be combined with 
transition metal catalysis to achieve a series of unique bond-forming 
reactions that take advantage of the known reactivity of each individual 
mode of catalysis'°'°. We recently questioned whether the combina- 
tion of photoredox catalysis and transition metal catalysis might be 
capable of delivering fundamentally new organometallic reactivity by 
providing access to currently unknown or inaccessible mechanistic 
pathways. 

Ina series of seminal studies, Hillhouse and colleagues demonstrated 
that Ni(11) alkoxide complexes do not readily undergo reductive elim- 
ination at ambient or elevated temperature and that stoichiometric 
conversion to a less stable Ni(im) system is required for productive 
C-O bond formation’””””. Similarly, C-heteroatom reductive elimina- 
tion has been demonstrated when stoichiometrically accessing Ni(Iv) 
intermediates”’. In fact, computational studies have revealed that 
Ni(i1) alkoxide reductive elimination is endothermic, in contrast to 
Pd, Pt or alkyl Ni(m) variants, which are exothermic (Fig. 1)”. 
Indeed, the use of palladium and copper catalysis to generate aryl ethers 
from alcohols is well precedented and broadly employed”*”’. The util- 
ization of nickel catalysis in C-O bond construction would be an 
important complementary method, considering the diversity of elec- 
trophiles amenable to nickel cross-couplings, such as alkyl halides 


and aryl pseudohalides'’. Hartwig and colleagues have reported a 
Ni(COD)>-catalysed aryl etherification at elevated temperature using 
three preformed oxides, namely sodium methoxide, ethoxide and 
tert-butyldimethylsiloxide with electron-deficient aryl bromides”. 
However, a general nickel-catalysed C-O coupling strategy under 
mild conditions has thus far been elusive and no examples with alco- 
hols have been documented. We recently questioned whether photo- 
redox catalysis might be used to modulate the range of nickel alkoxide 
oxidation states that can be accessed during a conventional catalytic 
cycle. More specifically, we recognized that it should be possible to 
utilize the photonic energy of weak visible light to expand thermo- 
dynamically the number of nickel-catalyst oxidation states via two 
discrete photocatalyst SET events: first, oxidation of Ni(m) to the 
elusive Ni(1)-alkoxide complex (which we assumed would facilitate 
the critical C-O bond-forming step); and second, reduction of Ni(1) to 
Ni(0), to enable the subsequent aryl bromide oxidative addition steps. 
If successful, we recognized that this new synergistic catalysis pathway 
might provide (1) the first general example of a nickel-catalysed C—O 
coupling reaction employing simple alcohols and aryl halides, and 
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Figure 1 | Modulating oxidation states of nickel enables challenging 
carbon-heteroatom coupling. Pd(11)-catalysed C-O reductive elimination is 
an exothermic process and well precedented. Ni(m1) C-O reductive elimination 
is thermodynamically disfavoured—we postulated that accessing Ni(1m) by 
photoredox-mediated oxidation state manipulation could switch on C-O 
coupling in a general fashion, circumventing this thermodynamic restriction. 
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(2) a demonstration of a new strategy by which elusive organometallic 
couplings can be ‘switched on’ via oxidation state modulation. 

A detailed description of our proposed mechanistic cycle for the 
photoredox/nickel-catalysed C—O coupling is outlined in Fig. 2a. 
Oxidative addition of Ni(0) (species 1) into an aryl bromide would 
deliver a Ni(1) aryl complex 2 (ref. 27). At this stage, ligand exchange 
(displacement of the bromide ion with the substrate alcohol) would 
produce the Ni(m) aryl alkoxide (3)—an organometallic species 
that traditionally represents a catalytic “dead end’. At the same time, 
visible light irradiation of heteroleptic iridium(i) photocatalyst 
Ir[dF(CF3)ppy]2(dtbbpy)PF, [dF(CF3)ppy = 2-(2,4-difluorophenyl)-5- 
(trifluoromethyl)pyridine, dtbbpy = 4,4’-di-tert-bu BL 2,2'-bipyridine] 
(4) would produce the long-lived photoexcited *Ir™!' state 5 (excited- 
state lifetime t = 2.3 [Us) (ref. 28). At this juncture, we hypothesized that 
the nickel and photoredox cycles would merge via SET between the 


a Modulating nickel oxidation states with photoredox catalysis 
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Figure 2 | Proposed mechanism by which photoredox catalysis switches on 
challenging nickel-catalysed C-O coupling. a, The catalytic cycle begins 
with oxidative addition of Ni(0) (species 1) into an aryl bromide to give Ni(m) 
aryl 2. Ligand exchange with an alcohol under basic conditions produces Ni(1!) 
aryl alkoxide 3. Excitation of photocatalyst 4 gives the excited state 5, 

which can then oxidize 3 to the key Ni(ur) intermediate 6 and generate 

Ir(1) 7. Nim) 6 readily reductively eliminates to form the aryl ether product 
and Ni(1) 8. A second SET event closes the two catalytic cycles, regenerating 
Ni(0) 1 and Ir(m) 4. b, Nickel-catalysed C-O reductive elimination can be 
turned on by addition of a photoredox catalyst and visible light. 
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Ni(11) complex 3 and the highly oxidizing photoexcited *Ir(m) 5 to 
generate the critical Ni(1m) aryl alkoxide 6 and the reduced Ir(m) 
photocatalyst 7. On the basis of the established redox potentials of 
these catalytic intermediates, we envisioned this key electron transfer 
step to be kinetically and thermodynamically favourable under stand: 
ard reaction conditions (half-reaction reduction potential E, ie 
[*Ic"/Ir"] = +1.21 V versus the saturated calomel electrode (SCE) 
in CH;CN, E12" [Ni”/Ni"] = +0.71 V versus Ag/AgCl in CH,Cl, 
for (bpy)Ni"(Mes)OMe)”*””. Once formed, we assumed the transient 
Ni(u1) complex 6 would rapidly undergo reductive elimination to 
forge the desired C—O bond, while delivering the aryl ether product 
and Ni(1) complex 8. At this stage, we envisioned both catalytic cycles 
merging for a second time, enabling the single-electron reduction 
of Ni(1) to Ni(0) by Ir(1) species 7 (Ey2°°4 (ir Ir] = —1.37 V versus 
SCE in CH3CN), thereby completing the nickel and photoredox cata- 
lytic cycles at the same moment”. 

On the basis of this synergistic catalysis design plan, we began our 
investigations into the proposed nickel-catalysed aryl—alcohol ether- 
ification using 1-hexanol and 4-bromoacetophenone. As expected, the 
use of traditional nickel cross-coupling conditions (that is, without 
photoredox), led to no observable product despite the implementation 
of a range of Ni(ml) and Ni(0) complexes (Fig. 2b, equation (1)). By 
contrast, we found that the desired C—O bond could be forged in 
excellent yield (86%) at room temperature via the introduction of 
the photocatalyst Ir[dF(CF3)ppy]2(dtbbpy)PF, in the presence of 
Ni(COD),, dtbbpy, quinuclidine, K,CO; and blue light-emitting 
diodes (LEDs) as the visible light source. Given the inherent advan- 
tages of employing a bench-stable Ni(u) catalyst in lieu of Ni(COD),, 
we hypothesized that a catalytically active Ni(0) species 1 might be 
accessible in situ via two SET reductions of (dtbbpy)Ni(m)Cl, using the 
iridium photocatal st (Fi a fr /tr"] = —1.37V versus SCE in 
CH3CN, EF; a [Ni"/Ni°’] = —1.2 V versus SCE in N,N-dimethylfor- 
mamide (DMF))***°. In this vein, we hypothesized that quinuclidine 
may also serve as a sacrificial reductant and were pleased to observe 
that with NiCl,(dtbbpy) the desired fragment etherification was 
accomplished in 91% yield (Fig. 2b, equation (2)). 

With the optimal conditions in hand, we next sought to explore the 
scope of the aryl bromide component in this new nickel-catalysed 
C—O coupling reaction. As shown in Fig. 3, a diverse array of 
electron-deficient bromoarenes with a variety of functional groups 
(ketones, trifluoromethyls, nitriles, sulfones, esters) perform well using 
this synergistic protocol (compounds 9-14, 88-96% yield). Moreover, 
3-bromoanisole was found to be a competent substrate in this trans- 
formation, demonstrating the diversity of substituents that can be 
tolerated (26, 80% yield). Notably, bicyclic aromatics such as phthali- 
mides and phthalides couple with high levels of efficiency (15 and 16, 
92% and 80% yield, respectively). With respect to heteroaromatic 
coupling partners, we have found that a range of pyridines with sub- 
stitution at the 2 and 3 positions (nitrile, trifluoromethyl, methyl) are 
effective electrophiles in this protocol (17-20, 67-91% yield). 
Furthermore, quinolines, azaindoles and pyrimidines can be employed 
without loss in efficiency (21-23, 60-88% yield). Perhaps most impor- 
tantly, this transformation is not limited to electron-deficient aryl 
bromides. For example, phenyl and tert-butylphenyl can be readily 
incorporated in this nickel etherification (24 and 25, 68% and 77% 
yield, respectively). It should be noted that comparable yields of aryl 
ether product 13 were achieved when either coupling partner was 
employed in excess (1.5 equiv.) (see Supplementary Information). 

We next turned our attention to the alcohol reaction component. As 
shown in Fig. 3, we discovered that a host of primary alcohols were 
effective coupling partners, including substrates that incorporate 
benzyl, tert-butyl, cyclopropyl and alkene functionalities (27—30, 
77—89% yield). Moreover, carbamates and ester groups are tolerated, 
as exemplified by the etherification of a serine derivative in 78% 
yield (31). Interestingly, trifluoroethanol couples proficiently in this 
reaction despite its diminished nucleophilicity (32, 77% yield). Both 
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Figure 3 | Alcohol and aryl halide scope in the photoredox-nickel-catalysed 
C-O coupling reaction. A broad range of aryl bromides and alcohols are 
efficiently coupled to produce aryl ethers under the standard reaction 
conditions (top, generalized reaction). The aryl bromide scope includes 
electron-deficient and electron-neutral arenes and heteroarenes with diverse 


methanol and d,-methanol are effective substrates, providing a valu- 
able strategy for installing H3CO- and D3CO- groups on arenes (33 
and 34, 87% and 81% yield, respectively). 

Relatively complex substrates are also tolerated in this transforma- 
tion, as demonstrated by the etherification of a protected pyranose 
in excellent yield (35, 82% yield). Notably, secondary alcohols are 
equally effective in this etherification protocol—alkyl, benzyl and pro- 
tected ether functionalities all being amenable to this technology 
(37—40, 79-86% yield). Moreover, secondary alcohols possessing 
B-quaternary carbons could be employed with good levels of efficiency 
(41, 74% yield). In the case of a substrate possessing both a primary 
and secondary alcohol, the less hindered site is arylated predomi- 
nantly, as demonstrated by product 36 (71% yield, 6:1 regiomeric ratio 
(rr)). This nickel-catalysed process can be applied to the expedient 
synthesis of medicinally relevant molecules, such as the antidepressant 
fluoxetine (Prozac), in three chemical steps (42, 82% yield for coupling 
step). It is important to note that using H2O as the nucleophile delivers 
the corresponding phenol product in a single step from commercially 
available materials (43 and 44, 65% and 62% yield, respectively). 

On the basis of our proposed design plan (Fig. 2a), we initiated 
mechanistic studies to determine if the critical Ni(1m) oxidation state 
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functionalities. Both primary and secondary alcohols are proficient coupling 
partners under the standard conditions. Water can be employed as the 
nucleophile to generate phenol derivatives in a single step. Isolated yields 
are indicated below each entry. See Supplementary Information for 
experimental details. 


was indeed operative in this new photoredox protocol. As shown in 
Fig. 4a, we preformed the stable Ni(11)(aryl)(alkoxide) complex 45 to 
examine whether C—O reductive elimination might be achieved under 
various photoredox or non-photoredox conditions. Importantly, in 
the absence of either light or photocatalyst, none of the desired aryl 
ether product 46 was observed. However, when complex 45 was 
exposed to Ir[dF(CF3)ppy]2(dtbbpy)PF, (4) and a visible light source, 
the desired ether product 46 was formed in 59% yield. We attribute this 
result to the formation of a transient Ni(im) complex 47 via the single- 
electron reduction of *Ir'", thereby enabling the favourable Ni(m) 
C—O reductive elimination, a result that is consistent with the 
seminal studies of Hillhouse. Furthermore, cyclic voltammetry of com- 
plex 45 showed an irreversible oxidation at +0.83 V versus SCE in 
CH3CN (Fig. 4b), which we attribute to the oxidation of Ni(II) to Ni(m) 
(45 to 47). As such, we expect the oxidation of Ni(11) aryl alkoxides 
such as 3 to be readily accomplished by *Ir' photocatalyst 5 (E,/."* 
(tT ytrt] = +1.21 V versus SCE)". Finally, Stern-Volmer fluor- 
escence quenching experiments have demonstrated that the emission 
intensity of *Ir'" 5 is diminished in the presence of Ni(1) complex 45, 
presumably signifying an oxidation event to form Ni(II) 47 (see 
Supplementary Fig. 7). 
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Figure 4 | Mechanistic studies support the intermediacy of transient Ni(1) 
complex to enable C-O reductive elimination. a, Reductive elimination 
to form C-O bond only occurs in the presence of photocatalyst and light. 
Reactions performed on 5.55 jtmol scale with 41 mol% photocatalyst 1 and blue 


A further series of experiments were performed using Ni(COD), as 
the nickel catalyst in lieu of NiCl, (see Supplementary Fig. 1). In the 
presence of this Ni(0) catalyst, the reaction proceeds with yields that 
are comparable to the Ni(m) precatalyst; however, when the iridium 
photocatalyst is omitted, the desired C—O coupling is not observed. 
This result suggests again that reductive elimination does not occur 
from the Ni(1) oxidation state as a single-electron oxidation to Ni(1m) is 
required to enable product formation. Interestingly, when quinucli- 
dine is omitted, the reaction efficiency is greatly diminished (8% yield 
over 24h). This effect can possibly be attributed to the amine function- 
ing as an electron shuttle, facilitating reduction of Ni(1) and oxidation 
of Ni(11); however, mechanistic studies are currently ongoing to elu- 
cidate the role of quinuclidine in more detail. 

We have developed a catalytic strategy for accessing transient 
Ni(im) complexes through the use of visible-light-mediated photore- 
dox catalysis for application in various challenging C—O cross- 
couplings. This method of modulating transition metal oxidation 
states has enabled a previously elusive transformation within the realm 
of nickel catalysis. We anticipate that this new mechanistic paradigm 
will find application in using nickel and other transition metals in a 
series of challenging bond constructions. 
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Reduced carbon emission estimates from fossil fuel 
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Nearly three-quarters of the growth in global carbon emissions 
from the burning of fossil fuels and cement production between 
2010 and 2012 occurred in China’. Yet estimates of Chinese emis- 
sions remain subject to large uncertainty; inventories of China’s 
total fossil fuel carbon emissions in 2008 differ by 0.3 gigatonnes of 
carbon, or 15 per cent’**. The primary sources of this uncertainty 
are conflicting estimates of energy consumption and emission 
factors, the latter being uncertain because of very few actual mea- 
surements representative of the mix of Chinese fuels. Here we 
re-evaluate China’s carbon emissions using updated and harmo- 
nized energy consumption and clinker production data and two 
new and comprehensive sets of measured emission factors for 
Chinese coal. We find that total energy consumption in China 
was 10 per cent higher in 2000-2012 than the value reported by 
China’s national statistics®, that emission factors for Chinese coal 
are on average 40 per cent lower than the default values recom- 
mended by the Intergovernmental Panel on Climate Change’, and 
that emissions from China’s cement production are 45 per cent less 
than recent estimates’*. Altogether, our revised estimate of China’s 
CO, emissions from fossil fuel combustion and cement production 
is 2.49 gigatonnes of carbon (2 standard deviations = +7.3 per 
cent) in 2013, which is 14 per cent lower than the emissions 
reported by other prominent inventories’**. Over the full period 
2000 to 2013, our revised estimates are 2.9 gigatonnes of carbon 
less than previous estimates of China’s cumulative carbon emis- 
sions’*. Our findings suggest that overestimation of China’s emis- 
sions in 2000-2013 may be larger than China’s estimated total 
forest sink in 1990-2007 (2.66 gigatonnes of carbon)’ or China’s 
land carbon sink in 2000-2009 (2.6 gigatonnes of carbon)’. 
Reports of national carbon emissions”''"* are based on activity data 
(that is, amounts of fuels burned) and emission factors (that is, amount 
of carbon oxidized per unit of fuel consumed), with these factors esti- 
mated as the product of the net carbon content (that is, tonnes carbon 
per joule), net heating value (that is, joules per tonne fuel), total carbon 
content (that is, tonnes carbon per tonne fuel) and oxidation rate (that 
is, carbon oxidized per carbon content; see Methods). The uncertainty 
of China’s emissions estimates is typically reported as +5% to +10% 


19,20,21 
’ 


(refs 4, 13, 15), but this range is somewhat arbitrary because neither the 
activity data nor the accuracy of emission factors is well known. For 
instance, national activity data are substantially different from the sum 
of provincial activity data’®, and the emissions factors used are not 
based on up-to-date measurements of the fuels actually being burned 
in China, of which the quality and mix are known to vary widely from 
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Figure 1 | Total carbon content and production of coal mines. The inset 
shows the comparison between carbon content from 602 coal samples and 
4,243 coal mines (R = 0.59, P< 0.001, n = 104). Each dot in the inset indicates 
the average of carbon content from 602 coal samples and 4,243 coal mines 

in the same 1° by 1° grid. The nearly one-to-one correlation indicates that 
samples and mines capture the same spatial variability of coal carbon content 
across China. 
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year to year, especially for coal’’. Indeed, using different official sources 
of activity data and emissions factors can result in estimated emissions 
that vary by up to 40% in a given year (see Methods). 

Here, we present revised estimates of Chinese carbon emissions 
from the burning of fossil fuels and cement production during the 
period 1950-2013 using independently assessed activity data and 
two sets of comprehensive new measurements of emission factors. 
Results suggest that Chinese CO, emissions have been substantially 
overestimated in recent years: 14% less than the estimates by the 
Emissions Database for Global Atmospheric Research (EDGAR) ver- 
sion 4.2 (EDGAR being adopted by the Intergovernmental Panel on 
Climate Change (IPCC) as the emission baseline) in 2013 and 12% less 
than the latest inventory China reported to the United Nations 
Framework Convention on Climate Change (UNFCCC; in 2005). 
The difference is due primarily to the emission factors used to estimate 
emissions from coal combustion; our measurements indicate that the 
factors applicable to Chinese coal are on average about 40% lower than 
the default values recommended by the IPCC”" and used by previous 
emissions inventories'*"*. 

In re-evaluating Chinese energy consumption, we adopt the ‘appar- 
ent consumption’ approach’*"*, which does not depend upon energy 
consumption data (that previous studies have shown to be not very 
reliable’*”’). Instead, apparent energy consumption is calculated from 
a mass balance of domestic fuel production, international trade, inter- 
national fuelling, and changes in stocks, data about which are less sub- 
ject to ‘adjustment’ by reporting bodies and accounting errors related to 
either energy consumed during fuel processing or assumptions about 
the mix of fuel types (especially coal) being used by individual consu- 
mers. Furthermore, this approach allows imported and domestically 
produced fuels to be tracked separately so that appropriate emission 
factors can be applied to these fuels (see Methods). 

Apparent consumption of coal, oil and natural gas in China in 2013 
was 3.84 Gt, 401.16 Mt and 131.30 Gm’, respectively. Between 1997 
and 2012, we estimate that cumulative energy consumption was 10% 
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Figure 2 | Histograms of Chinese coal properties. a, b, Total carbon content 
of 4,243 coal mines (a) and 602 coal samples (b). Dashed lines show mean, and 
shading indicates 90% and 95% intervals. c, d, Net carbon content (c) and 

net heating values (d) of the 602 coal samples. Carbon content for coal mines 
(a) and samples (b) are significantly lower than IPCC values, which is mainly 
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greater than the national statistics and 4% lower than provincial stat- 
istics (Extended Data Fig. 2). In addition, our results indicate a higher 
annual growth rate of energy consumption than national statistics 
between 2000 and 2010 (9.9% yr * instead of 8.8% yr '); the high 
growth rate is consistent with satellite observations of NO,”*', 
although NO,, to fuel emission factors change with time as well. 

Given the large fraction of CO emissions from coal combustion 
(80% between 2000 and 2013), estimates of total emissions are heavily 
dependent on the emission factors used to assess coal emissions. Thus, 
we re-evaluate each of the variables that determine these emission 
factors. The mean total carbon content of raw coal samples from 
4,243 state-owned Chinese coal mines (4,243 mines represent 36% 
of Chinese coal production in 2011)” (Fig. 1) is 58.45% (Fig. 2a), 
and the production-weighted total carbon content is 53.34%. 

These results straddle the results of an independent set of 602 coal 
samples from the 100 largest coal-mining areas in China (these areas 
together represent 99% of Chinese coal production in 2011)” 
(Extended Data Fig. 3), which reveal a similarly low mean carbon 
content of 55.48% (Fig. 2b) and a production-weighted mean total 
carbon content of 54.21%. The net carbon content per energy pro- 
duced of these same samples is 26.59 tC TJ~', or 26.32 tC TJ ' if 
weighted by production (Fig. 2c), and their net heating value is 
20.95 PJ Mt’, or 20.6 PJ Mt’ if weighted by production (Fig. 2d). 
Although the measured net carbon content per energy produced of 
these samples is within 2% of the IPCC default value (25.8 tC TJ — 1) the 
heating value from these coal samples (20.95 PJ Mt~ ) is significantly 
less than either the IPCC default value (coking coal) of 28.2 PJ Mt ‘or 
the mean value of US coal of 26.81 PJ Mt! (ref. 23). The lower heating 
value of Chinese coal reflects its generally low quality and high ash 
content (Fig. 2e, f). For example, the average ash content of our 602 
coal samples was 26.91% compared to the average ash content of US 
coal, 14.08% (ref. 23), but is consistent with recent studies”. 

Finally, we assessed the oxidation rate (carbon oxidized per carbon 
content) of the fossil fuels consumed by 15 major industry sectors in 
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China with 135 different combustion technologies (see Supplementary 
Data), as analysed by the National Development and Reform 
Commission (NDRC; ref. 25). We calculate a production-weighted 
average oxidation rate for coal of 92%, somewhat lower than the 
IPCC default value of 98%, but generally consistent with China-spe- 
cific values reported by the NDRC (94%)**, China’s National 
Communication that reported to UNFCCC (92%)%, and the estimate 
from a previous study (on average 93%)*°. Our estimates of the oxida- 
tion values of oil and natural gas in China (98% and 99%, respectively) 
are each within 1% of the IPCC default value. 

Combining our revised estimates of carbon content, heating value 
and oxidation value, we derive new emission factors for coal, natural 
gas and oil burned in China. The revised emission factors differ from 
IPCC defaults by —40%, +13% and — 1%, respectively (Fig. 3). In turn, 
applying these lower emission factors to our revised estimates of 
energy consumption, our best estimate of Chinese carbon emissions 
from fossil fuel combustion in 2013 is 2.33 GtC using the carbon 
content of 4,243 coal mine samples, and 2.31 GtC if the carbon content 
of 602 coal samples is used. On the basis of the residual scatter of 
carbon contents from these independent sets of coal samples 
(Fig. 1), the associated 2¢ uncertainty related to coal carbon content 
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Figure 3 | Comparison of emission factors in 2012. IPCC: default value from 
IPCC guidelines for national emission inventories (1996, 2006)”"!. NDRC: 
value reported by the NDRC (ref. 25). NC: China’s National Communication, 
which reported to the UNFCCC (2012 for value in 2005)*. All error bars are 
26 errors. 
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is of the order of 3%. Additional uncertainty about Chinese emissions 
is provided by varying the estimates of coal consumed, by +10% as 
evidenced by the range between national and provincial activity 
data’*’’. Combining these two numbers gives the 7.3% uncertainty 
range of Chinese fossil fuel carbon dioxide emissions. 

Wealso used clinker production data” to recalculate CO emissions 
from cement production (which accounts for roughly 7%-9% of 
China’s total annual emissions in recent years*). This direct method 
avoids use of default clinker-to-cement ratios (for example, 75% and 
95% in the IPCC guidelines”), and results in emissions estimates that 
are 32%-45% lower than previous estimates (0.17 GtC in 2012 com- 
pared to 0.30 GtC reported by the Carbon Dioxide Information 
Analysis Center (CDIAC) and 0.24 GtC by EDGAR; Extended Data 
Fig. 4). The clinker-to-cement ratio calculated by clinker production is 
58%, or ~23% lower than the latest IPCC default values. The new, 
lower estimated cement emissions are consistent with factory-level 
investigations” and several other recent studies”””®. 

Together, our revised estimate of fossil fuel and cement emissions in 
2013 is 2.49 GtC (26 = £7.3%); the new estimate is 12% less than the 
latest inventories China reported to the UNFCCC (1.63 GtC in 2005, 
20 = +8) and 14% less than the estimate by EDGAR version 4.2 (2.84 
GtC in 2013, 26 = +10%) (Fig. 4). By t-test, our revised estimates of 
fossil fuel and cement emissions during 2000-2013 are generally lower 
(at the 90% level) than estimates by EDGAR (P = 0.016) and CDIAC 
(P = 0.077). 

Our new estimate represents a substantial revision of annual global 
carbon emissions, reducing the global emissions in 2013 by 0.35 GtC, 
an amount larger than the reported increase in global emissions 
between 2012 and 2013 (ref. 31). A systematic reduction of fossil fuel 
and cement emissions of 0.35 GtC translates into a 15% smaller land 
sink when this term is calculated as a residual between anthropogenic 
carbon emissions, atmosphere carbon growth and the ocean carbon 
sink*’, and is two times the estimated carbon sink in China’s forests 
(0.18 GtC y'). Thus it implies a considerable revision of the global 
carbon budget*’. Over the full period 2000 to 2013, the downward 
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Figure 4 | Estimates of Chinese CO, emissions 1990-2013. Total carbon 
emissions from combustion of fossil fuels and manufacture of cement in 
China from different sources (International Energy Agency (IEA; sec, sectoral; 
ref, reference)’, Energy Information Administration (EIA; http://www.eia.gov/) 
and BP’ estimates do not include the emissions from cement production). 
The yellow dots are the numbers China reported to the UNFCCC in the 
years 1994 and 2005 (ref. 8). The red-shaded area indicates the 95% 
uncertainty range of carbon emissions calculated by this study, assuming 
the emission factors during the period 1990-2013 are the same as those 
determined for 2012 in this study. 


20 AUGUST 2015 | VOL 524 | NATURE | 337 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


revision of cumulative emissions in China by 2.9 GtC (13%) is larger 
than the cumulative forest sink in 1990-2007 (2.66 GtC)? or China’s 
land carbon sink in 2000-2009 (2.6 GtC)’®. Depending upon how the 
remaining quota of cumulative future carbon emissions is shared 
among nations, a correction of China’s current annual emissions by 
10% suggests a 25% (inertia basis) or 70% (blended basis) difference in 
the cumulative future emissions that can be emitted by China under a 
2°C warming target**. Evaluating progress towards national commit- 
ments to reduce CO, emissions depends upon improving the accuracy 
of annual emissions estimates and reducing related uncertainties”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Calculation of carbon emissions from fossil fuel combustion and cement 
production. Carbon emissions are calculated by using activity data, which are 
expressed as the amount of fossil fuels in physical units used during a production 
processes (activity dataginker is the amount of clinker produced) multiplied by the 
respective emission factor (EF): 


Emission = activity data x EF (1) 
Emissions from cement manufacturing are estimated as: 
Emissioncement = activity dataginker X EFginker (2) 


If data on sectorial and fuel-specific activity data and EF are available, total emis- 
sion can be calculated by: 


Emission = >. ys ce (activity data; ;, x EFi;4) (3) 


where i is an index for fuel types, j for sectors, and k for technology type. Activity 
data are measured in physical units (tonnes of fuel expressed as t fuel). 

EF can be further separated into net heating value of each fuel v, the energy 
obtained per unit of fuel (TJ per t fuel), carbon content c (tC TJ” fuel) and 
oxidization rate o (in %, the fraction of fuel oxidized during combustion and 
emitted to the atmosphere). The values of v, c and o are specific for fuel type, 
sector and technology: 


Emission = >») e > (activity data; jx X Vij.e X Cie X O1,k) (4) 


For the coal extracted in China (for example, for the 4,243 coal mines analysed in 
this study), net heating v and carbon content c values are not directly available, and 
a more straightforward emission estimate for coal emissions can be obtained using 
the mass carbon content (C,, in tC per t fuel) of fuels, defined by C,, = c X vso that 
the total emission can be calculated as: 


Emission = > > > (activity data; jx X Car, X 0 it) (5) 


Apparent energy consumption calculation. The activity data can be directly 
extracted as the final energy consumption from energy statistics, or estimated 
based on the mass balance of energy, the so-called apparent energy consumption 
estimation: 


Apparent energy consumption = domestic production + imports — exports 
+ change in stocks—non-energy use of fuels 
(6) 
Calculation of carbon emission from cement production. The carbon emission 
from cement production is due to the production of clinker, which is the major 
component of cement. When clinker is produced from raw materials, the calcina- 
tion process of calcium carbonate (CaCO 3) and cement kiln dust (CKD) releases 
CO: CaCO3 — CaO + CO. The amount of emission can be calculated from 
the molar masses of CaO (55.68 g mole ') and carbon (12 g mole ') and the 
proportion of their masses in clinker production. Furthermore, the emission 
associated with CKD that is not recycled to the kiln is calculated using the CKD 
correction factor, CF. gy. 
Carbon emission from cement production can be calculated the by clinker 
emission factor (EFuinker) and clinker production. 


Emissioncement = activity datacinker X EFuinker (7) 
EF dinker = EFCaO x (1+ CF eax) (8) 


EF CaOginker =fraction CaO x (12/55.68) =fraction CaO x 0.2155 (9) 


Fraction CaO is the mass proportion of CaO per unit clinker (in %). EF CaOginker 
is the mass of total carbon emission released as CaO per unit of clinker (unit: tC 
per tclinker). CF.g, is the CKD correction factor (in %). EF uinker is the mass of total 
carbon emission per unit of clinker (tC per t clinker) 

Clinker is the major component of cement. However, data on clinker produc- 
tion is less widely reported than that of cement production. Where data about 
clinker production are not available, the clinker-to-cement ratio Roinker-cement 
(in %) can be used to estimate the cement emission factor (EF ement) and to further 
estimate the emission based on cement production. 


Rainker—cement _ activity dataciinker / activity datacement (10) 
EF ‘cement — Reement—clinker x EF dinker (1 1) 


Emissioncement = EF cement X activity datacement 
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The IPCC default Fraction CaO (clinker) is 64.6%, and the fraction CaO (cement) 
is 63.5%; thus, the IPCC default EF uinker is 0.1384 (tC per t clinker). In the IPCC 
1996 guidelines, the clinker-to-cement ratio is 95%, which assumes that most 
cement is Portland cement and that the corresponding default EFcement is 
0.1360 (tC per t clinker). In the IPCC 2006 guidelines, the clinker-to-cement ratio 
is 75% when no direct clinker production data are available, and the corresponding 
default EF cement is 0.1065 (tC per t clinker). In this study, the clinker-to-cement 
ratio is calculated using clinker production statistics and cement production stat- 
istics. The cement production and clinker production statistics are listed in 
Supplementary Information. 

It should be noted that the non-energy use of fossil fuels and other industrial 

processes such as ammonia production, lime production and steel production will 
also produce carbon emissions. To be consistent with the scope of the international 
data set we are comparing, those emissions are not included in this study. On the 
basis of a previous study the total emissions of these non-energy fuel use and 
industry processes was equivalent to 1.2% of China’s emissions from fossil com- 
bustion in 2008 (ref. 14). 
The uncertainty range of China’s emission estimates. We conduct analysis to 
show the uncertainty range of China’s emission estimates based on emission 
factors (EFs) reported in the literature. We collected 12 sets of EF data for fossil 
fuel combustion from the six following official sources: IPCC (1996, 2006)”"’, 
China National Development and Reform Commission (NDRC)**, UN 
Statistics (UN)**, China National Communication on Climate Change (NC)*, 
China National Bureau of Statistics (NBS)? and Multi-resolution Emission 
Inventory for China (MEIC; http://www.meicmodel.org). There are three sets of 
EFs in the NDRC data, corresponding to three tiers of fuel classifications, four sets 
in NC and two sets in UN. We combined these 12 sets of EFs with two sets of 
energy statistics derived from national and provincial data®**. This yielded 24 
possible inventories for China’s carbon emissions of fossil fuel combustion for 
1997-2012 (Extended Data Table 1). The underlying data used in the commonly 
used data sets (IEA, CDIAC, BP, EDGAR) is either listed in this data assembly 
(NBS and IPCC) or not publically available. 

The mean value of 24 possible inventories is 2,523 MtC in 2012, and the 
standard deviation is 376 MtC (15%); the detailed data are listed in the 
Extended Data Table 1. The 2¢ standard deviation range suggested by 24 possible 
inventories is 30%, which is larger than the reported range of 10% by current 
emission data sets such as EDGAR. 

A Monte Carlo (Extended Data Fig. 1) approach was adopted to assess the 
distribution range of the emissions by assuming that all reported EF values have 
the same probability (values have been randomly selected with equal probabilities 
and calculated for 100,000 times). The mean value of the 24 members’ ensemble 
is 2.43 GtC in 2012 (95% confidence interval is +20%, —11% and maximum- 
minimum range of +27%, —15%). The uncertainty is attributed to the activity 
data (about 40% of total uncertainty) and EF (60%). The variability of EF for coal 
dominates the total uncertainty (55% for total uncertainty and 90% for the uncer- 
tainty by EF), whereas the EF for other fuels are more comparable. Different EF 
values for coal mainly reflect variation in v and hence C,, (C,, =v  c) values, 
whereas the variation of c and o are comparatively smaller (less than 10%). 

The distribution range of the emissions is listed in Extended Data Fig. 1. 

We assumed the equal possibility for various EFs when conducting the Monte 

Carlo analysis; this will expand the uncertainty range. However, both the standard 
deviation of 24 possible inventories and the Monte Carlo analysis show a signifi- 
cant uncertainty range, implying the considerable system error of the emission 
estimates by using reported EFs; thus it is critical to perform emission estimates 
based on a measurement-based EF. 
Apparent consumption calculation. We adopted the ‘apparent consumption’ 
approach to recalculate China’s energy consumption. The apparent energy con- 
sumption is the mass balance of fuels produced domestically for energy produc- 
tion, trade, international fuelling and change in stocks: 


Apparent energy consumption = domestic production + imports — exports 


+ change in stocks — non-energy use of fuels 
(13) 


The calculated apparent energy consumption is usually different from the reported 
energy consumption in China. For example, our recalculated energy consumption 
is higher (17% for coal, 2% for oil and 3% for gas) than the national reported energy 
consumption for 2013. 

We believe the resulting estimates of energy consumption to be more accurate 
than both national and provincial energy statistics, because of the following reasons. 

First, national energy statistics may be biased’**”** because of underreported 
fuel use in boilers from small factories and workshops'”*”**. In addition, the 
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adjustment of national statistics by the Chinese government has been discussed in 
the literature?’ 

Second, provincial energy statistics are also not reliable because of the consid- 
erable inconsistencies in provincial aggregated final-consumption energy statist- 
ics. When comparing energy consumption with total available energy supply 
(production plus imports and changes in stocks) in provincial statistics for 
2012, coal and oil show differences of 0.25 Gt coal and 81 Mt oil®”’, respectively. 
In addition, after removing international trade, the amount of exported and 
imported coal within all provinces should be equal to each other, whereas, in fact, 
we found an unexplained mismatch of 0.37 Gt coal in provincial aggregated energy 
statistics, equal to 21% of total domestically traded coal. 

Third, the apparent energy consumption is based on production and trade 
statistics. Chinese data on fuel production and trade statistics are more reliable 
and consistent than data on final energy consumption. After many years of policies 
to reduce or close private coal mines, 97% of the coal production in China (3.40 Gt 
coal in 2011) is from government-owned companies (including central and local 
governments) that keep good records of the mass of coal extracted’. This 
reliability is supported by the fact that national and provincial statistics on coal 
production differed by only 10% in 2012 (refs 6, 22), while the same sources 
reported coal consumption rates that differed by 37% (3.19 Gt for national data 
versus 4.36 Gt for provincial data). Moreover, coal production and trade data are 
consistently released earlier than coal consumption data, suggesting that the pro- 
duction data are the original data and therefore less prone to ‘adjustment’ for 
political or other proposes. Finally, trade data have also been monitored inter- 
nationally, so the numbers can be verified by different nations. 

Fourth, compared with the final energy consumption approach that involves 20 
kinds of primary and secondary energy products, the apparent consumption 
approach is much simpler: it considers only three primary fuel types (raw coal, 
crude oil and natural gas) in order to avoid accounting errors due to energy con- 
sumed during fuel processing (for example, mass loss in coal washing and coking). 

Fifth, the apparent energy consumption approach uses energy production data, 
which avoids having to deal with uncertain estimations of the mix of different coal 
types used by each final consumption category. When considering the variation of 
EFs for different fuel types and sectors, analysis of the sources of uncertainty is 
more complex. It is difficult to assess specific coal-burning EFs for a myriad of 
small consumers, and to scale these data up to the national level. Large energy 
consumers such as power plants continuously mix coal from different sources, 
which also makes it very difficult to assess national consumption-weighted average 
EF (weighted by share of different kinds and quality of coal consumed) from a 
consumption point of view. In contrast, production data can provide the national 
production-weighted average EF, and thus the national consumption-weighted 
average EF can be calculated by excluding the coal used for exports, non-energy 
use and stock changes. 

Sixth, the apparent consumption approach allows us to track imported and 
domestically produced fuels, so that a different EF can be applied. 

Between 1997 and 2012, the calculated apparent energy consumption was 10% 

greater (14% for 2012) than the one reported in national statistics and 4% lower 
than provincial statistics (Extended Data Fig. 2). The growth rate of apparent 
energy consumption is consistent with the growth rate of industrial productions 
(Extended Data Fig. 5). 
Sample selection. China’s coal resources are mainly concentrated in 100 major 
coal mine areas from 24 coal mine bases, and there are about 4,000 stable coal 
mines among these 100 coal mine areas that record coal production. The location 
of coal sampling is consistent with the distribution of coal mines (Extended 
Data Fig. 3). 

By collecting the coal samples, the following principles are adopted. 

First, the sampling spot is based on coal seams under production in one coal 
mine district, because the properties of coal from within a coal seam are almost the 


same. It is guaranteed that at least one sample is collected from each coal seam in 
one coal mine district. 

Second, every coal mine area is sampled, so the 602 samples are across 100 mine 
areas that cover the majority of the nation’s coal production. 

Third, there are at least three samples for each coal mine with a production over 
5 million tonnes. 

Fourth, in the same coal mine district, coal mines with high production are 
selected preferentially. 

Fifth, for sampling within a location, if the samples are collected from a coal pile, 
they should be collected from at least three different coal piles. If the samples are 
collected from a conveyor belt, they should be collected three times with several 
hour intervals from each other. All these three-times-collected samples are merged 
together and considered as one sample data point (in total 602 sample data points) 
for further analysis. All samples are stored in sealed plastic bags for further analysis. 
Sample analysis. For the sample measurements, we measured the air dry mois- 
ture, total moisture, net heating value, and the ash, carbon, hydrogen, nitrogen and 
total sulfur content. Carbon, hydrogen, nitrogen and total sulfur are determined by 
combustion using an Elementar elemental analyser. Coal samples are weighed into 
a tin capsule and burned in a tube furnace at 1,350 °C. Carbon dioxide, water, 
nitrogen dioxide and sulfur oxide are released from the samples and measured by a 
thermal conductive detector (TCD). Two parallel samples were tested together 
each time. The analysis is performed based on ISO standards: 

Measurement process (ISO 18283:2006: Hard coal and coke—Manual sampling). 

Air dry moisture (ISO 11722:2013: Solid mineral fuels. Hard coal. Determi- 
nation of moisture in the general analysis test sample by drying in nitrogen). 

The total moisture (ISO 589:2008: Hard coal. Determination of total moisture). 

Carbon, hydrogen and nitrogen contents (ISO 625:1996: Solid mineral fuels. 
Determination of carbon and hydrogen. Liebig method; ISO 29541:2010: Solid 
mineral fuels. Determination of total carbon, hydrogen and nitrogen content. 
Instrumental method). 

Ash content and volatile matter (ISO 11722:2013: Solid mineral fuels. Hard coal. 
Determination of moisture in the general analysis test sample by drying in nitro- 
gen; ISO 1171:1997: Solid mineral fuels. Determination of ash; and ISO 562:2010: 
Hard coal and coke. Determination of volatile matter). 

The net calorific value (ISO 1928:2009: Solid mineral fuels. Determination of 
gross calorific value by the bomb calorimetric method and calculation of net 
calorific value). 

Total sulfur contents (ISO 334:2013: Solid mineral fuels. Determination of total 
sulphur. Eschka method). 
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Extended Data Figure 1 | Uncertainty distribution of Chinese CO, data are assigned equal probabilities (n = 100,000). Chinese carbon emissions 
emissions 1997-2012. Monte Carlo simulations of the Chinese carbon based on national energy activity data (EN) and provincial activity energy 


emissions based on a blended activity data set where national and provincial _—_ data (EP) in 2012 are shown on the right bar. 
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Extended Data Figure 2 | Total fossil fuel energy consumption based upon national statistics, provincial statistics and calculations in this study. 
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Extended Data Figure 3 | Location of 4,243 coal mines with annual production and 602 coal samples. The coal samples and mines are consistent with spatial 
distribution. 
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Extended Data Figure 4 | Emission estimates of China’s cement production by different sources. 
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Extended Data Figure 5 | Growth rate of carbon emissions, based upon BP, 


Industrial products comprise the production of cement, iron, steel and power 
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Extended Data Table 1 | Twenty-four emission inventories of fossil fuel combustion based on reported emission factors and fuel inventories 


in China. 
Unit: MtC 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 (2008 2009 2010 2011 2012 
IPCC EP 1081 1113 1087 1109 1153 1256 1438 1662 1969 2180 2432 2573 2707 2964 3251 3379 
IPCC EN 1062 1028 1017 1016 1034 1106 1286 1543 1722 1892 2036 2220 2332 2442 2700 2783 
NBS EP 837 862 846 868 902 983 1127 1303 1548 1716 1912 2021 2126 2319 2561 2664 
NBS EN 827 805 800 803 819 879 1015 1228 1375 1514 1632 1760 1847 1933 2144 2213 
NDRC tier 1 EN 840 817 813 816 832 891 1037 1251 1400 1542 1662 1802 1893 1981 2196 2266 
NDRC tier 1 EP 851 877 861 881 918 1000 1148 1330 1580 1753 1958 2070 2181 2392 2626 2737 
NDRC tier 2 EP 841 868 849 869 903 984 1128 1305 1550 1718 1918 2030 2139 2346 2573 2685 
NDRC tier 2 EN 829 805 800 802 817 874 1016 1225 1370 1506 1622 1762 1850 1934 2143 2212 
NDRC tier 3 EP 850 879 858 879 910 995 1137 1319 1555 1723 1924 2037 2137 2340 2564 2675 
NDRC tier 3 EN 837 816 806 808 821 880 1019 1236 1370 1505 1620 1763 1843 1926 2133 2201 
NC 1994 EN 768 750 746 751 766 822 953 1151 1294 1427 1538 1656 1737 1819 2017 2084 
NC 1994 EP 774 798 784 805 838 913 1049 1213 1446 1603 1789 1892 1992 2178 2403 2504 
NC 2005 tier 1 EN 819 797 790 791 805 860 999 1206 1348 1481 1594 1734 1821 1903 2107 2174 
NC 2005 tier 1 EP 831 858 840 859 892 970 1111 1283 1529 1694 1891 2003 2111 2313 2535 2646 
NC 2005 tier 2 EN 816 791 784 783 796 849 985 1189 1330 1459 1570 1711 1796 1876 2075 2141 
NC 2005 tier 2 EP 828 856 836 854 885 962 1100 1270 1513 1675 1870 1981 2090 2288 2505 2615 
NC 2005 tier 3 EN 817 794 785 787 799 856 991 1199 1331 1462 1573 1713 1791 1872 2072 2138 
NC 2005 tier 3 EP 829 859 837 857 887 969 1106 1280 1513 1675 1870 1981 2080 2278 2495 2604 
MEIC EN 792 770 765 768 783 838 968 1171 1309 1440 1550 1672 1754 1834 2032 2099 
MEIC EP 802 829 810 832 862 938 1074 1241 1475 1633 1817 1923 2024 2204 2437 2537 
UN average EN 1103. 1063 1051 1048 1064 1134 1319 1575 1737 1895 2031 2258 2372 2490 2740 2818 
UN average EP 1126 1160 1134 1152 1200 1305 1487 1725 2042 2257 2518 2663 2804 3108 3353 3483 
UN China EN 843 814 808 809 822 876 1014 1215 1342 1464 1567 1747 1836 1929 2124 2186 
UN China EP 856 883 867 881 921 1001 1140 1328 1577 1745 1949 2061 2175 2430 2596 2707 
Standard deviation 105 109 105 107 113 126 142 155 196 219 260 265 280 339 357 376 
Average Value 869 870 857 868 893 964 1110 1310 1509 1665 1827 1960 2060 2212 2432 2523 
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Highly saline fluids from a subducting slab 
as the source for fluid-rich diamonds 


Yaakov Weiss', John McNeill’, D. Graham Pearson’, Geoff M. Nowell? & Chris J. Ottley” 


The infiltration of fluids into continental lithospheric mantle is a 
key mechanism for controlling abrupt changes in the chemical and 
physical properties of the lithospheric root’, as well as diamond 
formation’, yet the origin and composition of the fluids involved 
are still poorly constrained. Such fluids are trapped within dia- 
monds when they form*’ and so diamonds provide a unique 
means of directly characterizing the fluids that percolate through 
the deep continental lithospheric mantle. Here we show a clear 
chemical evolutionary trend, identifying saline fluids as parental 
to silicic and carbonatitic deep mantle melts, in diamonds from the 
Northwest Territories, Canada. Fluid-rock interaction along with 
in situ melting cause compositional transitions, as the saline fluids 
traverse mixed peridotite-eclogite lithosphere. Moreover, the 
chemistry of the parental saline fluids—especially their strontium 
isotopic compositions—and the timing of host diamond formation 
suggest that a subducting Mesozoic plate under western North 
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Figure 1 | Microinclusion compositions in fibrous diamonds from the 
Fox kimberlite, Ekati mine. a, Photomicrograph of diamond E191 with the 
location of the microinclusions analysed by electron probe micro-analyser 
(EPMA). Filled symbols indicate HDFs; open symbols indicate olivine and 
orthopyroxene (OPX). b, Composition of HDFs and micro-mineral 
inclusions associated with specific Fox diamonds coded by colour. The global 
compositional range of HDFs (delineated by average compositions for 
individual diamonds) and the wide range of compositions shown by individual 
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America is the source of the fluids. Our results imply a strong 
association between subduction, mantle metasomatism and 
fluid-rich diamond formation, emphasizing the importance of 
subduction-derived fluids in affecting the composition of the deep 
lithospheric mantle. 

Ancient sections of continental lithospheric mantle (CLM) are char- 
acterized by multi-stage evolution, involving strong depletion and melt 
removal followed by variable degrees of ephemeral refertilization’”. 
Refertilization, or enrichment, occurs by mantle metasomatism, 
whereby invading fluids or melts transport mobile components 
between different mantle reservoirs. This process plays a major part 
in shaping the mineralogical and geochemical variation in the CLM, as 
well as in determining its long-term stability, rheology and oxidation 
state’*. While many mantle samples reflect the action of metasoma- 
tism, including mantle xenoliths and mineral inclusions in diamonds, 
the nature of the fluids involved can normally only be constrained 


105 Saline HDFs i 


PM-normalized 


0.01 + 


: 
TSU eS 
BSEPXESISESSETN 


0.001 


Cs 
Nb 
Eu 
Ti 
Gd 
Tb 
Dy 
né 
Ho 
Er 
Tm 
Yb 
Lu 


diamonds PAN4 (ref. 5) and ON-DVK-294 (ref. 6) from neighbouring central 
Slave kimberlites are also shown. Shaded arrows define the compositional 
evolution trajectories of HDFs due to fluid-rock interaction and melting in 
carbonated peridotite (taupe arrow) and hydrous eclogite (pink arrow) 
lithologies (see also Fig. 2). c, Primitive-mantle (PM) normalized trace element 
and chondrite-normalized (CN) REE patterns of saline and silicic HDFs in 
fibrous diamond from the Fox kimberlite. Full analyses and additional figures 
are in Supplementary Tables 1-4 and Supplementary Fig. 1. 
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Figure 2 | MgO and SiO, versus Cl content of HDF microinclusions in 
fibrous diamonds from the central Slave craton. The complete data set shows 
clear evolution trajectories (shaded arrows) from a parental saline fluid to high- 
Mg carbonatitic and silicic compositions, formed due to wall rock reaction 
and local melting induced in peridotite and eclogite, respectively. If carbonate 
is present in the eclogite, increasing reaction could lead to the formation of 
low-Mg carbonatitic HDFs (dashed black arrow); however, this trend has not 
yet been constrained by mineral microinclusions paragenesis. Data points 
for saline HDFs are from this study and refs 5, 6; silicic HDFs from this study; 
saline to silicic from ref. 5; saline to high-Mg carbonatitic and silicic to low-Mg 
carbonatitic compositions from ref. 6. Calculated compositions are assumed to 
be free of H,O and CO, and are in weight per cent (wt%). 


indirectly from geochemical proxies or calculated using mineral/melt 
partition coefficients. Carbonatitic fluid or silicic melts have been pro- 
posed as the key metasomatic agents”’®. Direct samples of mantle 
metasomatic fluids are encased as microinclusions in fast-growing 
diamonds, known as ‘fibrous diamonds’ (Fig. 1a). These high-density 
fluid (HDF) inclusions are shielded from late-stage alteration, encap- 
sulating a unique chemical and physical record that can trace the 
sources of deep mantle fluids and constrain the processes that shape 
their nature. Their study has revealed that, along with carbonatitic and 
silicic melts, saline compositions, with very high Cl, K, Na and H,O 
contents®®, are involved in the metasomatic alteration affecting the 
deepest parts of the CLM. 

Diamond HDFs vary between four major compositional types: sal- 
ine, silicic, and high-Mg and low-Mg carbonatitic’ (Fig. 1b). A strong 
connection was established between high-Mg carbonatitic HDFs anda 
carbonated peridotite source, either lithospheric or asthenospheric in 
origin’’""”, while silicic and low-Mg carbonatitic HDFs have been 
related to hydrous eclogite (plus or minus carbonate)’. The saline fluid 
endmember sampled by diamonds is more enigmatic and its source in 
the deep lithosphere has remained ambiguous*”’*”’. 

Here we analysed 11 microinclusion-bearing fibrous diamonds 
from the Fox kimberlite, Ekati mine, Northwest Territories, Canada. 
They have either coated or cubic-like morphologies, contain nitrogen 
in A centres (pairs of nitrogen atoms replacing two adjacent carbon 
atoms) and encapsulate a variety of fluid compositions plus inclusions 
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Figure 3 | Trace-element ratios and Sr isotopic signature in HDFs from 
the central Slave craton. a, Relationship between Eu* (expressed here as 
Eu/Sm) and Sr* (Sr//(Pr X Nd)) anomalies in the saline HDFs constrain the 
subducted endmember, least influenced by interaction with lithosphere wall 
rock. b, Eu* versus La/Pr ratios. c, Eu* versus (°’Sr/**Sr); (corrected to 
kimberlite eruption age of 55 Ma). The positive trend formed by saline fluids 
varies between the Sr isotopic signature of sea water 150-200 Ma (ref. 26) and 
°’sr/°°Sr measured in megacrystalline clinopyroxene (CPX) from the Diavik 
CLM”. Higher (°’Sr/**Sr), values in silicic HDFs are probably inherited from 
old phlogopite in the eclogitic lithology within the Slave CLM. HDF 
composition symbols as in Fig. 2, large symbols are data from the present study, 
where each colour represents an individual diamond; small symbols are data 
from refs 5, 6, 12 and 14. 


of their host rocks (Fig. la, b and Extended Data Figs 1, 2), which 
shows a strong association between fluid composition and mantle host 
lithology. The majority of diamonds (9 of 11) contain saline HDFs 
solely associated with peridotite on the basis of their microinclusions 
of olivine, orthopyroxene, Cr-diopside and chromite. Silicic fluid 
compositions are related exclusively to eclogitic inclusions of ompha- 
citic clinopyroxene. Both saline and silicic HDFs are enriched in 
incompatible elements (Fig. 1c); they have fractionated rare-earth ele- 
ment (REE) patterns with elevated Ba, U, Th and light-REEs but 
depleted Nb, Ta and alkalis (K, Rb and Cs). However, the fractionated 
nature of these patterns and the light-REE/medium-REE and Th/U 
ratios in particular are more pronounced in saline HDFs than silicic 
fluids, indicating different sources. The most striking differences 
between the two HDF compositions are the positive Eu and Sr anom- 
alies within saline fluids versus no Eu anomaly and negative Sr 
anomalies in the silicic fluids. Initial °’Sr/*°Sr ((°’Sr/*°Sr);) values in 
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Figure 4 | Schematic illustrating the evolution of 
saline fluids with increasing fluid-rock 
interaction as they interact with cratonic mantle 
lithosphere. The discovery of fluids carrying 
strong oceanic protolith geochemical signatures 
(that is, positive Eu and Sr anomalies and 8767 /°6Sr 
signature of Mesozoic sea water) in continental 
lithospheric diamonds suggests that the Slave CLM 
was directly overlying the subducting slab at the 
time of Mesozoic metasomatism. Numbers refer to 
stages in the compositional evolution depicted in 
the inset MgO-Cl and SiO,-C] trends. When the 
parental saline fluids (1) ingress into the CLM they 
react and their oceanic signature is diluted. The 
melt-depleted nature of cratonic lithospheric 
peridotite prevents notable melting unless the 
saline fluids traverse either carbonated-peridotite 
or eclogite lenses, leading to in situ formation of 
high-Mg carbonatitic (2) and silicic melts (3), 
respectively. The possible presence of carbonate in 
eclogite may lead to formation of low-Mg 
carbonatite fluids with increasing melting (see 
Fig. 2 and Extended Data Fig. 5 for data). Rapid 
diamond formation occurs due to the oxidation 
gradient between the evolving fluids and local 
lithosphere, either as new fibrous diamonds or as 
fibrous coats on previously formed older 
octahedral diamonds. 
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the saline HDFs are 0.7039-0.7090 compared with 0.7064 and 0.7111 
in the two diamonds with silicic fluids. 

The physical and chemical characteristics of fibrous diamonds and 
their HDFs sampled by the Fox kimberlite are identical to previously 
studied fibrous diamonds from both the neighbouring (8 km north- 
east) Panda kimberlite at Ekati>!* and to those from kimberlites at 
Diavik mine*’* (30 km southeast). Combining these localities reveals 
that the vast majority of these fibrous diamonds (84%) trapped saline 
HDFs, which are strongly associated with peridotite hosts. In a single 
Diavik diamond, the HDFs continuously change from saline to high- 
Mg carbonatitic compositions, from centre to edge (Figs 1b, 2 and 
Extended Data Fig. 3); olivine, chromite and Cr-diopside microinclu- 
sions in this diamond demonstrate its peridotitic association’. A Panda 
diamond containing HDFs falling between saline to silicic composi- 
tions has omphacite microinclusions’, providing strong evidence that 
silicic HDFs may evolve from saline fluids due to wall rock reaction 
with eclogite (Figs 1b and 2). An absence of included minerals pre- 
vented paragenetic typing in only one diamond, from Diavik, contain- 
ing silicic to low-Mg carbonatitic HDFs (Fig. 2). However, the 
continuous global compositional array between silicic to low-Mg 
carbonatitic HDFs, and the similarity of these fluids to the products 
of low-degree partial melting experiments of carbonated eclogite, sug- 
gest a strong genetic link to eclogite’. The relative abundance of HDF 
endmembers in fibrous diamonds, the compositional relationships 
between the HDFs and their co-existing mineral microinclusions, plus 
the observed evolutionary trends from saline HDFs to other composi- 
tional types (Fig. 2), provide a means of tying the various metasomatic 
fluids to a common parental saline fluid endmember. A key issue is 
then the ultimate origin for saline HDFs. 

Positive Eu anomalies in low-pressure Cl -rich hydrothermal fluids 
are typically interpreted to result from plagioclase control during 
fluid-rock interaction at high temperatures’’. Such signatures can also 
originate from the strong aqueous complexes formed between dis- 
solved Cl” and Eu’*, compared to other REE ions!°. The lack of clear 
correlation between Cl content and the size of the Eu anomaly in global 
saline HDFs precludes a simple fluid-rock interaction process being 
the sole driver for generating the positive Eu anomalies. In Ekati and 
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Diavik diamonds, the pronounced Eu anomalies of saline HDFs are 
associated with positive Sr anomalies (Figs 1c and 3a), mimicking 
the plagioclase accumulation signature of both oceanic and ophiolitic 
gabbros’”"’. This correlation suggests a low-pressure crustal origin for 
the saline HDF elemental signature, through prograde metamorphic 
reaction of plagioclase to garnet and eclogite formation. Exactly how 
the saline chemistry of these fluids develops within subducting oceanic 
crust is not yet clear. One possibility is that they were originally pore 
fluids trapped in the crust during low-pressure hydrothermal altera- 
tion by sea water. Their initial salinity and K/Na ratios evolve as H,O 
and Na are consumed during spilitization, producing hydrated basalt. 
The highly saline nature of these new solutions potentially prevents 
dehydration during shallow subduction, allowing the formation of 
stable Cl -rich phengite at high pressure and tempreture’””°. If dehyd- 
ration should occur, residual chlorides with high K/Na ratios can be 
subducted and water originating from dehydration of underlying ser- 
pentinized peridotite at 150-200km depth” can regenerate highly 
saline fluids at depth. 

Having established a possible evolutionary link between the spec- 
trum of fluid compositions observed in Slave diamonds and subduc- 
tion-related crustal protoliths, we can deduce the metasomatic history 
of the central Slave lithospheric root leading to fibrous diamond 
formation (Fig. 4). The inherited positive Eu and Sr anomalies in saline 
diamond HDFs suggest direct ingress of fluids into the lithosphere 
from a subducting slab closely underlying the continental root. We 
explain the Eu and Sr anomalies, light-REE/medium-REE enrichment 
levels and variably radiogenic *’Sr/*°Sr in these fluids (Fig. 3a—c) as 
representing the interaction with peridotite in the lithospheric root. 
This interaction altered the elemental chemistry of the invading saline 
fluids, flattening both the Sr and Eu anomalies and lowering the ratios 
of the most incompatible elements. The *’Sr/*°Sr signature of the 
saline HDFs experiencing the most extensive fluid-rock interaction 
were buffered to local peridotite compositions that were relatively low 
(~0.704) (ref. 21). Fluid-rock interaction is also reflected by an 
increase of SiO, MgO and CaO and decrease of Cl and K in the saline 
HDFs (Figs 1b and 2). The refractory nature of cratonic peridotite 
dictated that partial melting during saline fluid infiltration only 
occurred when carbonate metasomes (that is, magnesite) were 
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intersected, leading to the formation of high-Mg carbonatite HDFs 
with low °’Sr/*°Sr (ref. 12). Infiltration of saline fluids into eclogite 
hosts is tracked by the compositional variation in eclogite-related 
HDFs, from saline to highly silicic (Figs 1b and 2), leading to the 
formation of in situ silicic melts. Both of the daughter high-Mg carbo- 
natitic and silicic melts could then crystalize metasomatic phases in 
their host rocks such as the Cl -rich phlogopite and apatite documen- 
ted in an eclogitic xenolith from Diavik’’. Overall, the Slave CLM was 
enriched with K, Cl, Ba and incompatible trace elements by the invad- 
ing saline HDFs, while the oxidation gradient between the evolving 
fluids and the local lithosphere initiated ephemeral redox processes 
leading to diamond formation”. 

The issue is the timing of the fluid metasomatism and the nature of 
the event that triggered the process. The short mantle residence time of 
fibrous diamonds in the central Slave CLM (<200 million years ago 
(Ma); Extended Data Fig. 4), indicated by their low-aggregated nitro- 
gen impurities, translates into young formation ages for both dia- 
monds and their HDFs. Active subduction zones were a key feature 
of the complex tectonic setting of western North America and the high 
Arctic during the Mesozoic era”, providing several options for the fluid 
source in the ideal time window to allow saline HDF generation, 
diamond formation and eruption of the diamonds in Eocene epoch 
kimberlites. The low-angle subduction that has been suggested for 
some of these plates, such as the Farallon slab”’, provides an opportunity 
for the direct transfer of slab-derived fluids into the base of the cratonic 
lithosphere. The most pristine saline HDFs that interacted least with the 
lithospheric root have Sr isotopic signatures corresponding with early 
Jurassic period seawater *’Sr/*°Sr values”*, strengthening the temporal 
connection between subduction and metasomatism (Fig. 3c). The litho- 
sphere beneath western North America was extensively hydrated by 
shallow subduction”””* and mantle xenoliths from the Wyoming craton 
provide direct evidence for chlorine enrichment”. Saline HDFs 
trapped in fibrous diamonds from the central Slave craton are a deeper 
manifestation of this lithospheric hydration process, expressed as young 
diamond formation in the CLM root. The full spectrum of HDF com- 
positional varieties, from saline, silicic and carbonatitic, are present in 
fibrous diamonds from various cratonic roots®”!*"’, including inter- 
mediate composition between saline to silicic found in an eclogitic 
zoned diamond from Guinea’ (Extended Data Fig. 5). We suggest that 
deep mantle saline fluids are directly related to subduction events—they 
are key metasomatic agents from which the whole spectrum of dia- 
mond-forming fluids evolve, and they play a major part in impacting 
the composition of the deep lithospheric mantle globally. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Samples and methods. A suite of eleven diamonds from the Ekati mine, Slave 
Craton, Canada, was selected for EPMA, Fourier-transform infrared (FTIR) and 
off-line laser ablation inductively coupled plasma mass spectrometry (ICP-MS) 
analyses. The diamonds have a large range in size with weight varying between 
3-83 mg. Each diamond was laser-cut and polished on both sides to create a thin 
slab that permits the transmittance of light. It was then cleaned ultrasonically 
in HF 60% and HNO; 69% for 2h and washed with ethanol and distilled water 
before analysis. 

FTIR. Analyses were performed using a Bruker IRscope II microscope coupled to 
a Nicolet 740 FTIR spectrometer (Globar source, KBr beamsplitter, MCT detector, 
He-Ne laser). Spectra were taken in the range of 550-4,000 cm — | with a resolution 
of 4cm7'. Nitrogen concentration and aggregation states were determined using a 
computer program supplied by D. Fisher and the absorption coefficients of A 
centres (double substitution of carbon by two nitrogen atoms, type IaA spectrum), 
B centres (clusters of four nitrogen atoms substituting five carbon atoms, type 
IaB spectrum) and C centres (single nitrogen replacing a carbon atom, type Ib 
spectrum)*!**, After baseline correction and subtraction of the diamond bands, the 
concentrations of water and carbonate were determined using the maximum absor- 
bance of water and carbonate and their absorption coefficients’. These concentra- 
tions were used to calculate the carbonate mole fraction (CMF = carbonate/ 
(water + carbonate) molar ratio) of the trapped fluids (Supplementary Table 1). 
EPMA. The major element compositions of the microinclusions were determined 
using a JEOL JXA 8600 EPMA equipped with a Pioneer-Norvar EDS (133 eV) 
detector. Backscattered electron imaging was used to detect shallow, subsurface 
microinclusions (<2 1m depth). Each inclusion was analysed for 100s using an 
acceleration voltage of 15 kV and a beam current of 10 nA. The spectral data were 
reduced using the ZAF/PROZA correction procedure software supplied by 
Noran”*. The total amount of oxides and Cl in each analysis varied between 1 
and 12.4 wt% with an average of 3.3 wt% for all 327 analysed HDF microinclusions 
and between 1.8 and 78 wt% with an average of 11 wt% for 68 analysed mineral 
microinclusions. Precision (20 (%) = 2 X 1/oxide in wt%) is <20% for oxide 
concentrations of 0.05 wt%, <10% for 0.25 wt%, <6% for 0.5 wt% and <2% for 
1 wt% (M. Jablon and O. Navon, unpublished data). The low and variable sums 
reflect the small size of the inclusions, their depth and their high content of 
undetected water and carbonates. The ZAF/PROZA processing assumed that 
the difference to 100 wt% is composed of pure carbon. Later, all oxide and chlorine 
concentrations were normalized to 100 wt% on a carbon-free and volatiles-free 
basis (where Cl is present, excess calculated oxygen leads to a normalized total of 
more than 100%) and the average composition of the HDF in the diamond was 
calculated. 

Offline laser ablation. Diamonds were ablated in a custom-designed, sealed PTFE 
ablation cell capped with a laser window that had been previously cleaned 
with UpA 6 N HCl and 2 N HNO;. Ablations were performed with a UV-213 
New-Wave Laser ablation system, with the custom cell replacing that provided 
by the manufacturer. A pre-weighed diamond was brought into focus and an 
ablation was performed using a raster pattern. Ablation conditions were: scan 
speed 50 ums _'; raster spacing 80 jum; energy output 5-6 J cm *; repetition rate 
20 Hz; spot size 160 jim; and pass depth 2 im. Ablation time varied from 3-5 h. 
After ablation, the laser cell is opened in an ultraclean environment and all ablated 
material was collected in UpA 6 N HCl before being dried down before further 
chemistry. The diamond was rinsed in MQ water and dried. Diamonds were 
re-weighed and the weight loss (0.32-0.71 mg) resulting from the ablation 
was calculated. Weighing uncertainty is +0.0007 mg, estimated from 100 repeat 
weighs of both a gem-quality and a fibrous diamond. The dried ablation product 
was taken up in 2 N HNO. A 20% aliquot is taken by volume for trace element 
analysis. The remaining sample was processed for Sr isotopic analysis. The Sr 
separation procedure is based on the method described previously”, using Sr-spec 
resin but with modifications as outlined’* for sub-ng samples. 

Quantifiable data and background corrections. We use the limit of quantifica- 
tion (LOQ), as defined previously”, as a measure of our ability to quantitatively 
measure elemental abundances because this parameter is considerably more 
robust than defining the ‘limits of detection’, or LOD, which merely define the 
ability to qualitatively detect an analyte. The LOQ for a procedure with a well- 
characterized blank is defined*? as: LOQ = 10a, where a is the standard deviation 
of the blank for the process (here defined as the total procedural blank (TPB)). This 
approach places clear limits on our ability to quantitatively report concentration 
data in the diamonds studied. We use a data set of 20 TPBs performed using the 
same ablation cells and reagents as used for samples, to determine the LOQ for 
trace element abundances. Within each batch of samples, between five and 
ten additional TPBs were also run to monitor whether our LOQ estimate was 
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applicable from one batch of samples to another. Any analyte below the LOQ 
is flagged in the data and not used on a concentration plot. In the previous 
definition’’, data can only be quantitative if it exceeds 10¢ of the blank, hence 
the analyte/blank ratio is a critical parameter to measure. The total amount of 
analyte and hence the analyte/background ratio is simply a function of the length 
of the ablation, with the ratio increasing with time. 

Multi-element ICP-MS. TPBs and aliquot sample solutions were analysed for 
trace element concentrations on the Thermo-Electron Element II ICPMS at 
Durham University. Each sample aliquot was made to 500 pl with 3% HNO3. 
Instrumental conditions were similar to those described previously”. Solution 
concentrations were measured against 9-point calibration lines constructed from 
appropriately dilute solutions of the international standards AGV-1, BHVO-1 and 
W-2. All concentrations were corrected for instrument drift using an 115In internal 
spike. Oxide correction coefficients were determined by running standard solutions 
of Ba, La, Ce, Pr, Nd, Sm, Gd and Tb at the beginning of each analytical session to 
correct for the daily changes in the oxide production rate. All trace element con- 
centrations were normalized to the diamond weight loss during ablation 
Thermal ionization mass spectrometry. With each batch of samples processed 
for isotopic analysis, between five and ten TPBs were carried out to determine the 
average size of the blank contribution and its effect on the isotopic composition of 
the sample. During the course of this study Sr blanks averaged 5 pg (n = 12). A Sr 
isotope blank correction was performed using a measured blank isotopic com- 
position based on combining the equivalent of over 60 TPBs to yield sufficient Sr 
(~500 pg) for a precise and accurate thermal ionization mass spectrometry 
(TIMS) analysis. The average *’Sr/*°Sr composition of the laboratory blank during 
the course of this work was 0.710853 + 0.000194 and all Sr samples were blank- 
corrected based on this value and the average blank set at 5 pg. Sr samples were 
loaded using procedures described in detail previously’, employing a purified 
TaF; activator. Sr isotope ratios were measured on a ThermoFisher Triton TIMS at 
Durham University. Sr isotope measurements were carried out using a static 
multi-collection routine. Each sample measurement achieved between 50 and 
300 ratios with an integration time of 4 s per ratio; total analysis time was approxi- 
mately 3-20 min. Mass fractionation was corrected using an exponential law and 
an *°Sr/*8Sr ratio of 0.1194. Multiple loads (n = 43) of NBS987 of between 0.5 and 
3 ng size gave an average value of 0.710260 + 0.00002 (2 standard deviations; 
n= 43), which compares well to the long-term values reported from the 
Durham University laboratory for similar sized standards from the same labor- 
atory*”**"1, As the Durham laboratory reports Sr data relative to an °’Sr/°°Sr 
ratio of 0.710240 no additional normalization was performed. Average signal size 
for the **Sr for the 0.5ng and 3ng standards were 0.8 +0.4V and 5+1.3V, 
respectively. Signal sizes for samples were on average 0.2 + 1 V. We have prev- 
iously documented in detail the levels of accuracy and repeatability for samples 
and standards at these low signal intensities**. There is no systematic relationship 
between analyte size and Sr isotope composition after blank correction. Hence we 
conclude that our blank correction procedures adequately correct for our system- 
atic TPB. Uncertainties in the magnitude and isotopic composition of the blank are 
incorporated into the reported errors on isotopic compositions at the 2¢ level. 
Previous experiments** indicate that for blanks of ~5 pg, it is possible to make 
accurate blank corrections to samples containing as little as 20 pg and therefore 
that level was used as a cut-off for accepting accurate data in this study because 
similar levels of blank reproduction were achieved. 


31. Boyd, S.R., Kiflawi, |. & Woods, G. S. The relationship between infrared 
absorption and the A defect concentration in diamond. Philos. Mag. B 69, 
1149-1153 (1994). 

32. Boyd, S.R., Kiflawi, |. & Woods, G. S. Infrared absorption by the B nitrogen 
aggregation in diamond. Philos. Mag. B 72, 351-361 (1995). 

33. Kiflawi,1., Myers, A.E., Spear, P.M., van Wyk, J. A. & Woods, G. S. Infrared absorption 
by the single nitrogen and A defect centres in diamond. Philos. Mag. B 69, 
1141-1147 (1994). 

34. Zaitsev, A. M. Optical Properties of Diamond Handbook 502 (Springer, 2001). 

35. Weiss, Y. Kiflawi, I. & Navon, O. IR spectroscopy: quantitative determination of the 
mineralogy and bulk composition of fluid microinclusions in diamonds. Chem. 
Geol. 275, 26-34 (2010). 

36. Bastin, G. & Heijligers, J. in Electron Probe Quantitation, Workshop at the National 
Bureau of Standards, Gaithersburg, Maryland (eds Heinrich, K. & Newbury, D.) 
145-161 (Plenum, 1991). 

37. Charlier, B.L.A.eta/. Methods for the microsampling and highprecision analysis of 
strontium and rubidium isotopes at single crystal scale for petrological and 
geochronological applications. Chem. Geo/. 232, 114-133 (2006). 

38. Harlou, R., Pearson, D. G., Nowell, G. M., Ottley, C. J. & Davidson, J. P. Combined Sr 
isotope and trace element analysis of melt inclusions at sub-ng levels using micro- 
milling, TIMS and ICPMS. Chem. Geol. 260, 254-268 (2009). 

39. Currie, L.A. Limits for qualitative detection and quantitative determination. 
Application to radiochemistry. Anal. Chem. 40, 586 (1968). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


40. 
Al. 


42. 
43. 


Font, L., Nowell, G. N., Pearson, D. G., Ottley, C. J. & Willis, S. G. Sr. isotope analysis of 
bird feathers by TIMS: a tool to trace bird migration paths and breeding sites. 

J. Anal. At. Spectrom. 22, 513-522 (2007). 

Klein-BenDavid, O. et al. Mixed fluid sources involved in diamond growth 
constrained by Sr-Nd-Pb-C-N isotopes and trace elements. Earth Planet. Sci. Lett 
289, 123-133 (2010). 

Stachel, T. & Harris, J. W. The origin of cratonic diamonds—constraints from 
mineral inclusions. Ore Geol. Rev. 34, 5-32 (2008). 

Taylor, W. R., Canil, D. & Milledge, J. Kinetics of Ib to laA nitrogen aggregation in 
diamond. Geochim. Cosmochim. Acta 60, 4725-4733 (1996). 


44. 


45. 
46. 


Taylor, W. R., Jaques, A. L. & Ridd, M. Nitrogen-defect aggregation characteris- 
tics of some Australasian diamonds: time-temperature constraints on 

the source regions of pipe and alluvial diamonds. Am. Mineral. 75, 1290-1310 
(1990). 

Klein-BenDavid, O., Izraeli, E. S., Hauri, E. & Navon, O. Mantle fluid evolution—a tale 
of one diamond. Lithos 77, 243-253 (2004). 

Sarkar, C., Heaman, L. M. & Pearson, D. G. Duration and periodicity of kimberlite 
volcanic activity in the Lac de Gras kimberlite field, Canada and some 
recommendations for kimberlite geochronology. Lithos 218-219, 155-166 
(2015). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Olivine 
Saline HDF 


{\ 
[yx 
NG 


@ Cpx 
Olivine 
A Saline HDF 


Omphacite 
_. Silicic HDF 


Extended Data Figure 1 | Photomicrographs and cathodoluminescence images of three diamonds from the Fox kimberlite. a, b, Diamond E154; 
c, d, diamond E217; e, f, diamond E142. The location of the microinclusions analysed by EPMA is imposed on the cathodoluminescence images. 
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Extended Data Figure 2 | Infrared spectra of six fibrous diamonds from the 
Fox kimberlite. Absorbance bands of the diamond itself, nitrogen in the 
diamond lattice, and carbonate and water in the microinclusions are observed 
in all diamonds. In diamond E221 olivine absorption due to the presence of 
olivine microinclusions is also observed, while in diamonds E142 and E11014 
the presence of pyroxene (omphacite) is detected. The nitrogen in all Fox 
fibrous diamonds is aggregated in 100% A centres. The nitrogen defect 
structure in diamonds evolves over time, from individual nitrogen atoms 
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replacing single carbon atoms (C centres) when the diamond forms, to pairs of 
adjacent nitrogen atoms (A centres), to four nitrogen atoms tetrahedrally 
arranged around a vacancy (B centres). At the mantle temperatures at which 
most diamonds form (that is, 950-1,250 °C)”, the full transformation from 

C to A centres (100% A centres) will form over a timescale of hundreds 

of thousands to millions of years, while A centres merge to form B centres 
over billions of years. 
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Extended Data Figure 3 | Microinclusion spatial location and MgO and point was radially normalized to allow plotting of the compositional change 
Cl composition in diamond ON-DVK-294 (refs 7, 45). a, The data point of the fluids along a uniform diamond profile. Clear chemical differences 
locations are superimposed on the original figure 3 from ref. 45 (reproduced _ between the inner and outer parts of this diamond exist; however, the transition 
with permission). The shift in the position of a few of the data points is related between the two zones in the diamond shows a gradual evolution rather 

to errors in the position of fiducial marks. b, The position of each data than an abrupt change. 
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Extended Data Figure 4 | Total N in parts per million (ppm) versus 
percentage of B centres for fibrous diamonds from the central Slave 
lithosphere. Isochrons are calculated for ambient temperatures of 930 °C 
(dotted red lines) and 1,010 °C (solid black lines) as described previously**™*. 
Fibrous diamonds in the Slave lithosphere have 600-1,600 ppm nitrogen and 
are essentially of pure IaA type (that is, %B = 0). The presence of a very small 
quantity of B centres (<1) is difficult to evaluate from FTIR spectroscopy but 
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such centres are necessary for calculating a possible age based on nitrogen 
aggregation. The geothermometry results for mineral inclusions in fibrous 
diamonds from the Panda kimberlite yielded low equilibrium temperatures of 
930-1,010 °C (ref. 5). Applying these temperatures to the amount of nitrogen in 
the central Slave diamonds and %B ~ 0 (grey area), as indicated from FTIR 
spectroscopy, reasonable mantle residence times for these diamonds vary 
between 1-210 Ma, before the kimberlite eruption age of 55 Ma (ref. 46). 
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Extended Data Figure 5 | MgO and SiO, against Cl content, all in wt%, 
water- and CO2-free composition, of HDF microinclusions in fibrous 
diamonds from the central Slave craton and diamond ON-KAN-383 from 
Kankan, Guinea. Intermediate compositions between saline to silicic are 
not unique to the PAN4 diamond’. The composition of 21 microinclusions 
with similar composition in an eclogitic zoned coated diamond from 
Kankan, Guinea (ON-KAN-383) was previously reported’. The saline-to-silicic 
inclusions in diamond ON-KAN-383 are restricted to the core-coat boundary 
of this diamond and develop, progressing towards silicic compositions 


(inner, middle) that finally evolve to low-Mg carbonatitic HDFs close to the rim 
of the diamond (outer). The paragenesis of this diamond and its analogous 
HDF compositions to eclogitic diamonds from the central Slave craton 
provide strong evidence for broader geographical involvement of saline fluids 
in the formation of silicic melts in eclogite, as well as for the continuous 
evolution (shaded red arrow) from saline through silicic to low-Mg carbonatitic 
composition when saline HDFs intersect with hydrous-carbonated eclogite 
lithology in the CLM root. 
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Reconstructing the reproductive mode of an 
Ediacaran macro-organism 


Emily G. Mitchell’, Charlotte G. Kenchington'”, Alexander G. Liu’, Jack J. Matthews* & Nicholas J. Butterfield! 


Enigmatic macrofossils of late Ediacaran age (580-541 million 
years ago) provide the oldest known record of diverse complex 
organisms on Earth, lying between the microbially dominated 
ecosystems of the Proterozoic and the Cambrian emergence of 
the modern biosphere’. Among the oldest and most enigmatic of 
these macrofossils are the Rangeomorpha, a group characterized 
by modular, self-similar branching and a sessile benthic habit”*. 
Localized occurrences of large in situ fossilized rangeomorph 
populations allow fundamental aspects of their biology to be 
resolved using spatial point process techniques’. Here we use such 
techniques to identify recurrent clustering patterns in the rangeo- 
morph Fractofusus, revealing a complex life history of multige- 
nerational, stolon-like asexual reproduction, interspersed with 
dispersal by waterborne propagules. Ecologically, such a habit 
would have allowed both for the rapid colonization of a localized 
area and for transport to new, previously uncolonized areas. The 
capacity of Fractofusus to derive adult morphology by two distinct 
reproductive modes documents the sophistication of its under- 
lying developmental biology. 

Late Ediacaran sedimentary strata (~580-541 million years ago 
(Ma)) of Newfoundland and the UK are dominated by rangeomorphs, 
whose unique self-similar branching construction’ makes resolution 
of their phylogenetic relationships, or even their basic biology, dif- 
ficult’. The occurrence of rangeomorphs in conspicuously deep-water 
sediments has led to a consensus that they were heterotrophic’, while 
the global distribution of charniids (a rangeomorph sub-group) has 
been interpreted as evidence for reproduction via waterborne propa- 
gules’. In the present study we use spatial statistics and modelling®* ina 
novel approach to illuminate the reproductive biology and underlying 
ecology of one of the most abundantly preserved rangeomorph fossils, 
Fractofusus’. 

We analysed three large bedding-plane assemblages of Fractofusus 
in southeast Newfoundland: (1) the ‘D’ surface and (2) the ‘E’ surface at 
Mistaken Point, Avalon Peninsula;”'® and (3) the H14 surface on 
Bonavista Peninsula (locality 14 of ref. 11) (Extended Data Fig. la-c). 
A volcanic tuff directly above the ‘E surface has been dated to 
565 + 3 Ma (ref. 12), which also constrains the age of the underlying 
‘D’ surface. Regional lithostratigraphic correlations suggest that the 
H14 surface is a few million years younger than the Mistaken Point 
beds'’. All three assemblages occur within deep-marine turbidite 
sequences, with Fractofusus fossils preserved as negative epirelief 
external moulds in siltstone hemipelagites, cast from above by volca- 
niclastic deposits®. 

Fractofusus is conspicuously endemic, restricted almost exclusively 
to southeastern Newfoundland”, where it dominates many macrofos- 
sil assemblages'®. Fractofusus has a rounded, elongate spindle-like 
morphology, with two (arguably three”’’) offset rows of irregularly 
alternating, self-similar, subdivided frondlets arranged along a central 
axis*"*, Fractofusus specimens range from 1cm to 42cm in length? 
(Fig. la, b); two species have been described, distinguished by their 
length:width (L/W) ratios”. The ‘D’ and ‘E’ surfaces are dominated by 


the elongate form, Fractofusus misrai (L/W = 3.2; Fig. 1b), whereas the 
more ovate Fractofusus andersoni (L/W = 1.6; Fig. 1a) dominates the 
H14 surface’. Fractofusus occurs in dense benthic populations and 
exhibits no evidence of motility or current orientation’. Together with 
nearest neighbour spatial analyses’®, these observations point to a 
sessile, recumbent, benthic mode of life in aggregated communities. 
The spatial positions of Fractofusus were mapped to millimetre-scale 
resolution using differentiated Global Positioning System (GPS) 
(Extended Data Fig. 1d, e) on the two surfaces at Mistaken Point, and 
by tracing specimen outlines onto acetate sheets at H14; importantly, 


Figure 1 | Fractofusus specimens from Newfoundland, Canada. 

a, F. andersoni specimen from the H14 surface. b, F. misrai from the ‘E’ surface, 
showing a large size-class partial specimen (~20 cm, above) alongside a 
small size-class specimen (3.5 cm in length, below). Scale bars, 1 cm. 
Photographs are non-retrodeformed. 
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this latter approach also allowed size data to be recorded (Extended 
Data Fig. 1f). The ‘D’ and ‘E’ surface data were corrected for tectonic 
deformation before analysis (Extended Data Fig. 2)’. Heterogeneous 
Poisson models were used to identify possible distortions arising from 
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Figure 2 | PCF for mapped taxa. For all plots the x axis is the inter-point 
distance between organisms in metres. On the y axis, PCF = 1 indicates 

CSR, <1 indicates segregation and >1 indicates aggregation. a, PCF for 
Fractofusus on the ‘D’ surface (1,040 specimens), ‘E’ surface (1,141 specimens) 
and H14 surface (1,214 specimens). Grey shaded area depicts the bounds of 
99 Monte Carlo simulations of CSR. Since the PCF curves are not completely 
within these areas, the CSR hypothesis is rejected and one can assume that 
the Fractofusus distributions on all three surfaces form cluster patterns 
(p?a<0.01, p®a < 0.01, p'!*4 < 0.01). b, PCF for non-CSR ‘EF surface taxa 
(charniid 76 specimens, Charniodiscus 326 specimens, Primocandelabrum 311 
specimens and Thectardis 39 specimens). Grey shaded area depicts 99 Monte 
Carlo simulations of the best-fit H14 surface model of double Thomas 
cluster process. Note how the ‘E’ surface Fractofusus PCF follows the H14 
surface PCF very closely, and can be modelled by the same process (pg = 0.51). 
Other ‘E’ surface taxa have markedly different PCF to the Fractofusus PCF. 

c, PCF for the three size-classes of Fractofusus on H14 surface. Grey shaded 
area depicts the 99 Monte Carlo simulations of CSR. The large size-class 

(350 specimens) exhibits CSR (pg = 0.30); the intermediate size-class 

(310 specimens) shows aggregation <0.10 m (single Thomas cluster model, 
Pa = 0.51). The small size-class (554 specimens) shows a large aggregation 
<0.08 m and a lesser aggregation between 0.08 m and 0.20 m (double Thomas 
cluster model, pa = 0.72). 


344 | NATURE | VOL 524 | 20 AUGUST 2015 


differential erosion of the bedding planes (Supplementary Table 1). 
Pair correlation functions (PCFs) were calculated to describe the spatial 
distributions of taxa on each bedding plane (Fig. 2a)’. Monte Carlo 
simulations’ and Diggle’s goodness-of-fit test® (the p-value pg, in which 
Pa = 1 indicates a perfect model fit, and pa = 0 indicates no fit), were 
used to compare the fit of different spatial models to the data (specif- 
ically homogeneous and heterogeneous Poisson models’® and single 
and double homogeneous and heterogeneous Thomas cluster mod- 
els)'®. PCFs were also used to describe the spatial distributions of taxa 
other than Fractofusus on the ‘D’ and ‘E’ surfaces (Fig. 2b). For the H14 
surface, spatial relationships between three distinct Fractofusus size- 
classes (defined in Methods; Extended Data Fig. 3a, b) were analysed 
by calculating partial PCFs° and comparing model fit of bivariate shared 
parent (SP) models with linked cluster models (LCMs) (Fig. 2c)’%. 
Finally, spatial directionality was investigated by plotting the general- 
ized K-functions”” of Fractofusus specimens from H14 surface from 0° 
to 360° (isotropy plots), allowing visualization of the relative directional 
positions of specimens (Fig. 3). 

Non-random spatial distributions of sessile organisms, namely 
those that do not exhibit complete spatial randomness (CSR), can be 
explained by either extrinsic factors (for example, environmental het- 
erogeneities), or intrinsic reproduction’’. Identifying the processes 
behind such patterns is not straightforward; however, extrinsically 
induced patterns are generally best modelled by heterogeneous 
Poisson models'*, which describe randomly distributed points with a 
non-uniform density across the sampled area. In contrast, intrinsic 
processes typically generate Thomas cluster models'’, where the points 
within each cluster have a normal density distribution centred on a 
parent point. 

All three populations of Fractofusus were found to be significantly 
aggregated, conforming closely to homogeneous Thomas cluster mod- 
els (Fig. 2a). Specimens on the ‘E’ and H14 surfaces are aggregated at 
two spatial scales, forming clusters of clusters (Fig. 2a, b). On the ‘E’ 
surface, this distribution is best modelled by a nested homogeneous 
double Thomas cluster model of 23 clusters (radius r= 0.242 m), 
each containing 12 smaller clusters (r= 0.074m) of 3 specimens 
(pa = 0.76). The H14 surface distribution is best modelled by a nested 
homogeneous double Thomas cluster model of 24 large clusters 
(r = 0.237m), each containing 6 clusters (r = 0.079 m) of 8 specimens 
(pa = 0.89). The ‘D’ surface distribution forms discrete clusters (not 
clusters of clusters), which are best modelled by a single Thomas 
cluster model (pg = 0.77) with 338 Fractofusus clusters of 3 specimens 
(r = 0.086 m) (Extended Data Tables 1 and 2). Importantly, the spatial 
distribution on the ‘E’ surface can also be modelled by the same 
nested double cluster pattern as found on the H14 surface (Fig. 2b) 
(pa'' °" ¥ = 0.51), strongly implying the same underlying process for 
both distributions (Fig. 2b and Extended Data Table 3). The spatial 
distribution of Fractofusus on the ‘D’ surface is conspicuously similar 
to that seen in the larger specimens on H14 (Extended Data Fig. 4e). 
By contrast, the spatial distributions of other taxa—Thectardis, 
Primocandelabrum and Charniodiscus—exhibit fundamentally differ- 
ent magnitudes and spatial scales of aggregation, both to each other 
and to those of Fractofusus (Fig. 2b and Extended Data Tables 4 and 5). 

The close fit of Fractofusus spatial distributions to single and 
nested double Thomas cluster models strongly suggests that they 
derive from reproductive rather than extrinsic (environmental) 
factors. Reproductive biology is further corroborated by size analysis 
of the Fractofusus population on the H14 surface (Fig. 2c and Extended 
Data Fig. 4a, c-e), which reveals strikingly different spatial patterns 
for each of the three size-classes (Fig. 2c, Extended Data Tables 1 
and 2). Whereas the largest size-class (>11.0 cm) is randomly distrib- 
uted (pg = 0.30), both the intermediate (5.5-11.0cm) and smallest 
(<5.5 cm) size-classes are hierarchically clustered: small individuals 
cluster around intermediate individuals (pg'©“ = 0.74 versus 
i = 0.03; Extended Data Table 5), which in turn cluster around 
large individuals Ga = 0.66 versus pa = 0.01). In other words, 
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Figure 3 | Isotropy plots from the H14 surface for each size-class of 
Fractofusus, providing a visualization of specimen positions relative to one 
another. The vertical axis on each part depicts the colour map of specimens 
per square metre normalized to account for different densities between size- 
classes. A peak (>1) is shown in green or yellow and depicts aggregation, while 
a dip (<1) is shown in blue and depicts segregation. If there are no directional 


the smallest specimens form clusters (homogeneous nested double 
Thomas cluster model, pg = 0.72) around intermediate-sized speci- 
mens (homogeneous single Thomas cluster model, pg = 0.51), which 
are themselves clustered around randomly distributed large specimens 
(homogeneous Poisson model, pg = 0.31; Figs 2c and 4, Extended Data 
Fig. 5 and Extended Data Tables 1, 2 and 5). Moreover, the isotropy 
plots for H14 (Fig. 3) show strong directionality for the large size-class, 
but limited directionality for the medium and small size-classes. The 
nested clusters on the ‘E’ and H14 surfaces suggest three generations, 
while the single clusters on the “D’ surface suggest two, reflecting an 
earlier stage in population development. The difference is consistent 
with the suggestion that the “D’ surface records an earlier stage in the 
ecological succession of Ediacaran macroscopic communities’ 
(Supplementary Information section 2.3). 

As with other Ediacaran macrofossils, there is no direct fossil evid- 
ence of reproductive habits in Fractofusus, but its recurrent distri- 
bution on bedding surfaces provides a statistically robust approach 
for inferring the underlying processes’’. In modern oceans, large sessile 
organisms typically reproduce by means of waterborne propagules, 
fragmentation/budding and/or stolons (that is, production of asexual 


Figure 4 | Schematic diagram showing simplified Fractofusus spatial 
arrangements. The actual number of clusters, and clusters within those 
clusters, is higher than shown (23 clusters each containing 12 clusters of 3 
specimens on the H14 surface), making their direct visual detection 
challenging. No overlapping specimens are shown because, while the best-fit 
models allow for overlaps, the observed PCF between the small size-class 
(Extended Data Fig. 4c, d) and the large size-class (Fig. 2b) shows a 

small segregation (<3 cm) away from the model behaviour, and a similar, 
non-significant segregation for the large size-class. 


effects then the colour map in every direction from the centre point should be 
similar. a, The large size-class shows strong anisotropy, with aggregation of 
up to four normalized specimens per square metre. b, c, In contrast the medium 
(b) and small (c) size-classes show isotropy, namely a relative evenness of 
aggregations with a maximum density variation up to 0.5 normalized 
specimens per square metre. 


clones that are at least initially connected to the parent by specialized 
outgrowths). 

Spatial distribution of waterborne propagules—including both sexual 
and asexual spores, as well as sub-millimetre buds and fragments—is a 
function of current and rate of sinking. Even with rapid sinking 
(~1mms_‘)° and slow currents (~1cms''), propagules released 
from the dorsal surface of a ‘parental’ Fractofusus (~2-3cm above 
the substrate) would have been current aligned”! and dispersed by deci- 
metres or more”. Slow descent times also correspond with right-skewed 
(mean greater than the median) density distributions*””’. The random 
spatial distribution of the H14 largest size-class most probably reflects a 
large dispersal distance (Extended Data Table 1), which, coupled with its 
highly directional isotropy plot (Fig. 3a), indicates that the largest speci- 
mens were strongly influenced by currents (cf. refs 7, 10). As such, they 
probably derive from waterborne propagules and represent the initial 
establishment of a Fractofusus population on this surface. 

The hierarchically clustered bedding plane distributions of small 
and medium Fractofusus on H14 closely match patterns exhibited 
by organisms reproducing asexually via stolon-like lateral extensions 
(Extended Data Fig. 5)”. Cluster distributions of the small and med- 
ium size-classes are also highly left-skewed (median greater than the 
mean), with the mean distance from each ‘parent’ to their ‘offspring’ 
on the order of a few centimetres (Fig. 2a), and offspring exhibiting no 
significant directionality or current orientation (Fig. 3b-c, Extended 
Data Fig. 3c). The reproducibility of the model distributions across the 
three bedding-plane assemblages further attests to the indifferent 
effects of current: the spatial distributions of non-tethered offspring 
would result in patterns dependent on current velocity, which are 
unlikely to be consistent across multiple bedding planes in different 
localities. Moreover, there are no recorded instances of buds or frag- 
mentary specimens of Fractofusus in any of its 5,000+ documented 
specimens’”®''*4 (see Supplementary Information section 2.1 and 
2.5). As such, the Fractofusus clusters on the H14 surface are not 
consistent with waterborne propagules or fragmentation/budding, 
but are directly comparable to stolon-like reproduction. Other taxa 
exhibit an intriguing range of non-random habits, and our prelim- 
inary analyses indicate that Primocandelabrum and Charniodiscus 
may have also reproduced using stolons. 

Reproductive biology lies at the core of ecological and evolutionary 
dynamics, and its positive identification in Ediacaran macrofossils has 
the potential to illuminate the beginnings of the modern marine bio- 
sphere. Previous studies of Ediacaran macrofossils have investigated 
the seasonality of reproduction’, identified putative stolons” and 
inferred sexual or asexual reproduction on the basis of biogeographic 
distribution or qualitative description of local populations”"°?””. In 
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the case of phosphatized ‘embryo’ microfossils, internal cell packages 
have been interpreted as evidence of germ-soma differentiation’, but 
it remains to be seen how those fossils relate to the evolution of large 
and/or complex eukaryotes. The identification in Fractofusus of a 
multigenerational asexual clonal phase, interspersed with the release 
of waterborne propagules, is the first statistically robust account of 
reproductive life history reported in an Ediacaran macrofossil. Such 
a strategy would have allowed for the rapid exploitation of localized 
areas, as well as for transport to new, previously uncolonized areas. The 
conclusion that Fractofusus could switch between reproductive modes 
further reveals the sophistication of its underlying developmental pro- 
gramme, capable not only of tissue differentiation, but also the gen- 
eration of new macroscopic individuals both from benthic stolons and 
from waterborne propagules. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 27 February; accepted 8 June 2015. 
Published online 3 August 2015. 


1. Liu, A. G, Kenchington C. G. & Mitchell, E. G. Remarkable insights into the 
paleoecology of the Avalonian Ediacaran biota. Gondwana Res. 27, 1355-1380 
(2015). 

2. Gehling,J.G.& Narbonne, G. M. Spindle-shaped Ediacara fossils from the Mistaken 
Point assemblage, Avalon Zone, Newfoundland. Can. J. Earth Sci. 44, 367-387 
(2007). 

3. Narbonne, G. M. Modular construction of early Ediacaran complex life forms. 
Science 305, 1141-1144 (2004). 

4. Hoyal Cuthill, J. F. & Conway Morris, S. Fractal branching organizations of 
Ediacaran rangeomorph fronds reveal a lost Proterozoic body plan. Proc. Nat! 
Acad. Sci. USA 111, 13122-13126 (2014). 

5. Illian, J., Penttinen, A., Stoyan, H. & Stoyan, D. Statistical Analysis and Modelling of 
Spatial Point Patterns Vol. 70 John Wiley, 2008). 

6. Wood, D.A., Dalrymple, R. W., Narbonne, G. M., Gehling, J. G. & Clapham, M. E. 
Paleoenvironmental analysis of the late Neoproterozoic Mistaken Point and 
Trepassey formations, southeastern Newfoundland. Can. J. Earth Sci. 40, 
1375-1391 (2003). 

7. Darroch, S.A. F., Laflamme, M. & Clapham, M. E. Population structure of the oldest 
known macroscopic communities from Mistaken Point, Newfoundland. 
Paleobiology 39, 591-608 (201 

8. Wiegand, T., Gunatilleke, S., Gunatilleke, N. & Okuda, T. Analyzing the spatial 
structure of a Sri Lankan tree species with multiple scales of clustering. Ecology 88, 
3088-3102 (2007). 

9. Landing, E., Narbonne, G.M. & Myrow, P. (eds) Trace fossils, small shelly fossils and 
the Precambrian-Cambrian boundary. Bull. NY State Mus. 463, 1-81 (1988). 

10. Clapham, M.E., Narbonne, G. M. & Gehling, J. G. Paleoecology of the oldest known 
animal communities: Ediacaran assemblages at Mistaken Point, Newfoundland. 
Paleobiology 29, 527-544 (2003). 

11. Hofmann, H. J., O’Brien, S. J. & King, A. F. Ediacaran biota on Bonavista Peninsula, 
Newfoundland. Can. J. Paleontol. 82, 1-36 (2008). 

12. Benus, A. P. Sedimentological context of a deep-water Ediacaran fauna (Mistaken 
Point Formation, Avalon zone, eastern Newfoundland). Bull. NY State Mus. 463, 
8-9 (1988). 

13. Narbonne, G. M., Laflamme, M., Trusler, P. W., Dalrymple, R. W. & Greentree, C. 
Deep-water Ediacaran fossils from northwestern Canada: Taphonomy, ecology, 
and evolution. J. Paleontol. 88, 207-223 (2014). 


Ww 
SS 


346 | NATURE | VOL 524 | 20 AUGUST 2015 


14. Brasier, M. D., Antcliffe, J.B. & Liu, A.G. The architecture of Ediacaran fronds. 

Palaeontology 55, 1105-1124 (2012). 

15. Diggle, P. Statistical Analysis of Spatial Point Patterns 2nd edn (Arnold, 2003). 

16. Baddeley, A. & Turner, R. Practical maximum pseudolikelihood for spatial point 

patterns. Aust. NZ J. Stat. 42, 283-322 (2000). 

17. Chiu, S.N., Stoyan, D., Kendall, W. S. & Mecke, J. Stochastic Geometry and its 

Applications 3rd edn John Wiley, 2013). 

18. Lin, Y., Chang, L, Yang, K., Wang, H. & Sun, |. Point patterns of tree distribution 

determined by habitat heterogeneity and dispersal limitation. Oecologia 165, 

175-184 (2011). 

19. Droser, M. L. & Gehling, J. G. Synchronous aggregate growth in an abundant new 
Ediacaran tubular organism. Science 319, 1660-1662 (2008). 

20. Gaylord, B., Reed, D.C., Raimondi, P. T. & Washburn, L. Macroalgal spore dispersal 
in coastal environments: mechanistic insights revealed by theory and experiment. 
Ecol. Monogr. 76, 481-502 (2006). 

21. Shanks, A. L. Pelagic larval duration and dispersal distance revisited. Biol. Bull. 216, 
373-385 (2009). 

22. Gaylord, B., Reed, D., Raimondi, P., Washburn, L. & McLean, S. A physically based 
model of macroalgal spore dispersal in the wave and current-dominated 
nearshore. Ecology 83, 1239-1251 (2002). 

23. Araki, K., Shimatani, K. & Ohara, M. Dynamics of distribution and performance of 
ramets constructing genets: a demographic-genetic study in a clonal plant, 
Convallaria keiskei. Ann. Bot. (Lond.) 104, 71-79 (2009). 

24. Narbonne, G. M. & Gehling, J. G. Life after snowball: the oldest complex Ediacaran 
fossils. Geology 31, 27-30 (2003). 

25. Peterson, K. J., Waggoner, B. & Hagadorn, J. W. A fungal analog for Newfoundland 
Ediacaran fossils? Integr. Comp. Biol. 43, 127-136 (2003). 

26. Fedonkin, M. A. in Vendskaya Sistema 1, istoriko-geologicheskoe i 
paleontologicheskoe obosnovanie paleontologiya Vol. 1 (eds Sokolov, B.S. & 
lvanovskiy, A. B.) 70-106 (Nauka, 1985). 

27. Penny, A.M. et al. Ediacaran metazoan reefs from the Nama Group, Namibia. 
Science 344, 1504-1506 (2014). 

28. Yuan,X. etal. The Lantian biota: a new window onto the origin and early evolution of 
multicellular organisms. Chin. Sci. Bull. 58, 701-707 (2013). 

29. Hua, H., Chen, Z., Yuan, X., Zhang, L. & Xiao, S. Skeletogenesis and asexual 
reproduction in the earliest biomineralizing animal Cloudina. Geology 33, 
277-280 (2005). 

30. Chen, L. Xiao, S., Pang, K., Zhou, C. & Yuan, X. Cell differentiation and germ-soma 
separation in Ediacaran animal embryo-like fossils. Nature 516, 238-241 (2014). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements The Parks and Natural Areas Division, Department of 
Environment and Conservation, Government of Newfoundland and Labrador, provided 
permits to conduct research within the Mistaken Point Ecological Reserve in 2010, 
while the Department of Tourism, Culture and Recreation provided permits for 
palaeontological research on the Bonavista Peninsula in 2012. Access to both of the 
aforementioned fossil localities is by scientific research permit only. Contact the 
relevant Department listed above for further information. This work has been 
supported by the Natural Environment Research Council (grant numbers NE/ 
1005927/1 to C.G.K., NE/J5000045/1 to JJ.M., NE/LO11409/1 to A.G.L. and NE/ 
G523539/1 to E.G.M.), and a Henslow Junior Research Fellowship from the Cambridge 
Philosophical Society to A.G.L. We thank M. Laflamme for discussions on this 
manuscript. 


Author Contributions E.G.M. conceived the project, collected data on the ‘D’ and ‘E’ 
surfaces and ran the analyses. C.G.K., A.G.L. and JJ.M. collected data on the H14 
surface. All authors discussed the results and prepared the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to E.G.M. (ek338@cam.ac.uk). 


©2015 Macmillan Publishers Limited. All rights reserved 


METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Data collection. Fossil taxa*' and spatial positions on the Mistaken Point ‘D’ and 
‘E’ surfaces (Extended Data Fig. 1a, b, d and e) were recorded using differentiated 
GPS over 15 days. A fixed GPS transmitter was installed on a headland overlooking 
the bedding planes, and a portable GPS receiver was used to map 4,496 individual 
fossils over a total area of 123.7 m’ for both surfaces (Supplementary Information 
section 2.1); the mean accuracy of data points was 0.4 + 0.06 cm horizontally and 
0.82 + 0.11 cm vertically. The measured position of each specimen represents the 
mean of five separate GPS readings collected over 5 s; both the accuracy (standard 
deviation of the five readings) and the associated weather conditions were 
recorded for each reading. Mann-Whitney tests were used to compare the densi- 
ties of specimens recorded in differing weather conditions with the null hypothesis 
that the density should not depend on the weather conditions on the day of data 
collection. 

Fossil positions on the H14 surface, Bonavista Peninsula (Extended Data Fig. 
1f), were recorded by tracing the outline of each specimen onto 2m X 5 m acetate 
sheets. Cleavage and other geological features were also traced. These data were 
collected by three people, two holding the sheets in position and the third record- 
ing the data. Wind-induced slippage (affecting large-scale spatial relationships 
(>0.75 m) for three out of five of the sheets) was determined by measuring the 
differences between cleavage features crossing sheets, yielding a mean accuracy of 
1.47 + 0.26 cm along strike, and 1.53 + 0.08 cm parallel to dip. Over the 0.5m 
distance that the PCFs were calculated, these errors translate to 0.37 + 0.26cm 
along strike and 0.15 + 0.08 cm parallel to dip—substantially less than the 1 cm 
cells within which specimen densities were measured to calculate the PCFs 
(Methods: Testing for non-random spatial distributions). The sheet approach 
was used to map the H14 surface because it provided size data more efficiently 
than direct measurement plus GPS. 

Specimen identification. Specimens were recorded as belonging to 1 of 12 taxo- 
nomic groups of macrofossils, including two ‘bin’ groups”, as follows. 1, 
Bradgatia. 2, Pectinifrons. 3, Thectardis. 4, F. andersoni+ F. misrai. 5, 
Charniodiscus spinosus + Charniodiscus procerus. 6, ‘Feather dusters’. 7, 
Hiemalora. 8, Ivesheadiomorphs**. 9, Lobate discs. 10, Charnia ‘A’ + Charnia 
‘B’. (Charnia ‘A’ consists of Beothukis mistakensis**** (which dominates the ‘E’ 
surface) and Charnia masoni. Charnia ‘B is now reassigned as Trepassia wardae™. 
Charniid populations on Mistaken Point are dominated by Beothukis (only four 
individuals on the ‘E’ surface are true Charnia species), therefore direct compar- 
ison of data from this grouping with those from other taxonomic groups should be 
undertaken with caution.) 11, ‘Holdfast discs’ (all discoidal specimens of uncertain 
affinity, with or without associated stems, lacking sufficient detail to identify the 
taxon). 12, ‘Other species’ (rare forms that do not fall into any of the other groups; 
for example, Hapsidophyllas). 

Retrodeformation. The tectonically distorted data from the Mistaken Point 
surfaces were retrodeformed by returning elongated holdfast discs to a circular 
outline’. The ‘D’ surface (based on 13 specimens) showed a deformation factor of 
1.35 + 0.11 (R? = 0.92), and the ‘E’ surface (based on 12 specimens) 1.71 + 0.08 
(R* = 0.754), both within the previously measured range’”. In the absence of any 
obviously directional distortion or suitable deformation indicators, measurements 
taken from the H14 locality were not adjusted in this fashion. 

Data collection bias. The impact of mechanical weathering on the ‘D’ and ‘E’ 
surfaces was investigated by modelling the fossil distributions as heterogeneous 
Poisson processes. Fossils were originally covered in a thin layer of volcanic tuff, 
which has since been partly weathered away to expose the bedding planes, poten- 
tially inducing bias. If the density of a particular taxon is correlated to modern 
weathering features, then such processes are likely to be masking the true palaeon- 
tological distribution of the fossils. Initial data exploration and residual analysis of 
weathering effects was performed in R* using the package spatstat’’. Four covari- 
ates, corresponding to four potential erosion sources, were investigated as follows. 
(1) Across the bedding plane (south to north) x, which is differentially eroded by 
cliff fall and water runoff from a small stream on the northern side for the ‘E’ 
surface. (2) Along the three bedding planes (west to east) y, which are subject to 
differential erosion from wave action. (3) The southwestern corner xy, which is the 
first point of contact for most waves on the ‘E’ surface. (4) The height of the fossils 
above the troughs of the tectonic ripples on the ‘D’ and ‘E’ surfaces h, which is an 
inverse proxy for ash coverage. 

On each bedding plane, and for each parameter, the spatial density of fossils 
in relation to the parameter was plotted, along with the best-fit quadratic line. 
This best-fit line was then used to model the change of density compared 
with the covariant. The inhomogeneous models were tested primarily on the 
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non-retrodeformed data (since retrodeformation may mask any aggregation 
due to preservational bias), and verified by conducting similar tests on the retro- 
deformed data. Two different methods were used to compare the different 
inhomogeneous models: Kolmogorov-Smirnov tests were performed on quadrats 
of the data to investigate the distributions relative to the four covariates” 
(Supplementary Table 1), then the model fit was assessed using the model 
residuals”*. Model residuals assessed the fit of the model to the data by plotting 
Q-Qand smoothed residual plots. If the observed line in the Q-Q plot fell outside 
two standard deviations of the model, the model was rejected**. Akaike informa- 
tion criterion values” were used to compare the relative quality of the statistical 
models that fitted the data. 

Bias generated by differing light conditions was tested by comparing fossil 

densities in areas either side of a specific grid line: on one side the fossils were 
mapped under optimal conditions (clear skies and low-raking light), and on the 
other under sub-optimal conditions (cloud-cover with limited shadow-cast). The 
extent of ash erosion was similar across this grid line, so any measured differences 
in fossil density are likely to be an artefact of observer bias resulting from different 
weather conditions. The densities were then compared using a Mann-Whitney 
test. 
Testing for non-random spatial distributions. Initial data exploration, inhomo- 
geneous Poisson modelling and residual analysis were performed in R® using the 
package spatstat**. Programita*® was used to find distance measures and to per- 
form aggregation model fitting (described in detail in refs 40-42). PCFs were used 
to assess which spatial distributions did not exhibit CSR, where the PCF value 
reflects how many times more likely the distribution seen is aggregated (or segre- 
gated) compared with CSR, as follows. (1) A distribution map was plotted for 
individual taxa, with the surfaces split into a grid of 1cm X 1cm cells, within 
which the population density was calculated. (2). The smoothed PCF was calcu- 
lated with smoothing dependent on number of specimens for each taxon. A 3-cell 
smoothing was applied for Fractofusus (D, E and H14), 5 cells for Charniodiscus 
(E), Pectinifrons (D), Bradgatia (D and E) and Primocandelabrum (E), and 15 cells 
for Thectardis (E) and charniids (D and E). (3) Ninety-nine simulations were run 
for each taxon on a homogeneous background to generate simulation envelopes 
around the random (PCF = 1). Ninety-nine simulations were run (instead of 100, 
for example) so that the pg values could be measured in 0.01 increments. (4) Values 
for pq were calculated using Diggle’s goodness-of-fit test’. 

For those taxa found to exhibit excursions outside the simulation envelope, four 

types of process were then fitted to the data: heterogeneous Poisson process, 
Thomas single cluster processes on both homogeneous and heterogeneous back- 
grounds, and Thomas double cluster process. The resulting models were then 
compared to find the best model for each taxon. 
Complexities of assessing model fit. Testing for significance with spatial point 
data is more complicated than for classical statistics owing to lack of independence 
and variety of point pattern distributions’. Monte Carlo simulations provide a 
good assessment, but the simulation envelope does not necessarily correspond to a 
confidence interval’*, and runs the risk of type 1 error if the observed PCF falls near 
the edge of the simulation envelope’’. The size of simulation envelopes depends on 
the sample size, so that smaller sample sizes (such as the H14 large size-class of 350 
specimens) have a relatively large simulation envelope in contrast to the ‘D’, “E 
and H14 surfaces (all >1,000 specimens). A comparatively large simulation envel- 
ope reduces the likelihood that the null model (such as CSR) is rejected. 
Consequently, hypothesis testing needs to be further supplemented. We used 
Diggle’s goodness-of-fit test, which is a single test statistic!’ (pa) representing 
the total squared deviation between the observed pattern and the theoretical result 
across the studied distances. This test statistic was used in conjunction with visual 
inspection of Monte Carlo simulations for two reasons. First, pg does not strictly 
test whether a model should be accepted or rejected, but whether the PCFs for the 
observed data are within the range of the stochastic realization of the model”. 
Second, pa depends on the range over which it is calculated. For example, the 
model that best describes the ‘E’ surface data has pg = 0.56, which may appear low. 
However, inspection of the PCF (Extended Data Fig. 4b) shows a very close fit to 
the double Thomas cluster model above 2 cm (Extended Data Table 2). The finite 
size of Fractofusus is reflected in the lower PCF values at small distances, so the 
model is only fit above 2 cm. 

Interpreting ecological processes from spatial point patterns is imprecise. 
Different processes can produce similar spatial patterns*'****, with the complex 
interplay of intra- and interspecific interactions affecting organismal distribu- 
tions***, Even so, application of complementary statistical techniques, such as 
PCFs combined with comparisons of inhomogeneous Poisson and Thomas cluster 
models, and nearest neighbour distance analysis, offers the most effective means of 
teasing out the underlying ecological processes. 

Model fitting. If a taxon was not randomly distributed on a homogeneous 
background (Extended Data Table 1), the random model on a heterogeneous 
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background was tested. Six different heterogeneous backgrounds were generated, 
as follows (Extended Data Table 4). (1) The first heterogeneous background was 
created from the density map of the taxon under consideration, being defined by a 
circle of radius r over which the density is averaged throughout the sample area. 
Density maps were formed using estimators within the range of 0.1 m<r<1m, 
and the radius corresponding to the best-fit model was used. (2) The second 
heterogeneous background was created from density maps of all specimens on 
each surface combined. (3) The third to sixth heterogeneous backgrounds were 
created from the four separate density maps of Fractofusus, ivesheadiomorphs, 
Charniodiscus and Primocandelabrum. 

This procedure follows that used to test for a non-random distribution on a 
homogeneous background (Methods: Testing for non-random spatial distribu- 
tions), except at point 3, where the homogeneous background on which the taxa 
were simulated is replaced by a heterogeneous one. If excursions outside the 
simulation envelopes for both homogeneous and heterogeneous Poisson models 
remained, then cluster models were fitted to the data. For each non-random taxon, 
univariate cluster models were fitted as follows (Extended Data Table 1). (1) The 
PCF and L-function” of the observed data were found. Both measures were 
calculated to ensure that the best-fit model was not optimized towards only one 
distance measure, and thus encapsulated all spatial characteristics. (2) Best-fit 
Thomas cluster processes” were fitted to the two functions where PCF > 1. The 
best-fit lines were not fitted to fluctuations around the random line of PCF = 1, so 
as to aid good fit about the actual aggregations and to limit fitting of the model 
about random fluctuations. Programita used the minimal contrast method*"* to 
find the best-fit model*. (3) If the model did not describe the observed data well, the 
lines were refitted using just the PCF. If that fit was also poor, then only the 
L-function was used. (4) Ninety-nine simulations of this model were generated 
to create simulation envelopes, and the fit checked using the O-ring statistic*®. 
(5) The value of pg was calculated over the model range. Very small-scale segrega- 
tions (under 2 cm) were not included in the model fitting, since they probably 
represented the finite size of the specimens, and the lack of specimen overlap. (6) If 
there were no excursions outside the simulation envelope and the value of pg was 
high, then a univariate homogeneous cluster model was interpreted as the 
best model. 

Taxa exhibiting two scales of clustering were modelled as Thomas double 

cluster processes on a homogeneous background, and as single Thomas cluster 
processes on a heterogeneous background as follows (Extended Data Table 2). 
(1) The PCF was plotted and ranges for the two different scales of clustering were 
found. For example, the small-scale cluster may be 0 <r<0.5 m, and the large- 
scale cluster 0.5m <r< 1.5m. (2) The large-scale cluster model was fitted. (3) The 
parameters of the large-scale single cluster model were used as parameters for the 
large-scale clusters of the double cluster model. (4) Nearest-neighbour functions 
were calculated and compared with the parameters of the different-scaled clusters 
to test for nestedness. 
Comparison between and within taxa. To assess whether Fractofusus spatial 
distributions could be similarly modelled on all three bedding planes, the best- 
fit model from each surface was fitted to the other two surfaces. Simulation 
envelopes and values of pq were used to evaluate the fit (Fig. 2b and Extended 
Data Table 3). 

The uniqueness of each taxon’s spatial distribution was assessed by fitting 
the best-fit models for high-abundance taxa (Fractofusus, Charniodiscus, 
Primocandelabrum) onto each other. Low-abundance taxa (charniids, Thectardis) 
were excluded from this comparison because they yielded large simulation envel- 
opes, and consequently very different models fitted within the generated envelopes. 
Size-classes. The Fractofusus size data from the H14 surface allowed investigation 
of interactions between Fractofusus specimens of different sizes. To determine 
whether there was a dependency between spatial distribution and specimen 
size, the mark correlation function was calculated'”*! and compared with the 
simulation envelope produced from 99 Monte Carlo simulations (Extended 
Data Fig. 3a). 

The most objective way to resolve the number and range of size-classes in a 
population is by fitting size-frequency distribution data (the natural log of the 
variables length, width and the bivariate case of length multiplied by width) to 
various models, followed by comparison of (logarithmically scaled) Bayesian 
information criterion values’, which we performed in R using the package 
MCLUST». The number of populations thus identified was then used to define 
the most appropriate size-classes. A Bayesian information criterion value differ- 
ence of >10 corresponds to a ‘decisive’ rejection of the hypothesis that two models 
are the same, whereas values <6 indicate only a weak rejection of the similarity of 
the models.” 

Once defined, the spatial distributions for each size-class were analysed 
using the techniques described in the ‘Model fitting’ section. Although it was 
necessary to set firm boundaries for each size-class, the populations were normally 


distributed and therefore overlapped. As a result, the largest individuals of the 
small population were grouped within the middle size-class, while some of the 
smallest of the medium population were included within the small size-class. 
Using this information, further analyses were performed to consider the spatial 
relationships between individual size-classes. The PCF between each size-class 
was determined, the best-fit shared parent and LCMs were fitted (Extended Data 
Table 5 and Extended Data Fig. 4c, d) and the fit of each model assessed. Shared 
parent models describe the pattern when two clustered size-classes both cluster 
around an (un-defined) point or area, such as the effect of two different hetero- 
geneous soil effects on tree growth*’. LCMs arise when one size-class clusters 
around another size-class, for example when tree saplings cluster around their 
parent”. The procedure behind model fitting was similar to the single group/size- 
class (univariate) case; however, instead of single groupings, two size-class 
(bivariate) PCFs were used instead, as follows. (1) The best-fit Thomas cluster 
processes were found for each of the size-classes (shared parent and LCM). 
(2) The single size-class parameters for each size were input to the model classes 
(both models for the shared parent model and only the parent model for LCM). 
(3) The best-fit model was fitted to the PCF and L-functions. (4) If the model fit 
was poor (errors>0.025), the model was fitted using the PCF and then the 
L-function. If neither were a good fit, then the spatial scale that the model was 
fitted to was reduced, so that a good model could be found for at least part of the 
spatial scale. (5) The model was checked using 99 Monte Carlo simulations, pg 
values, and by comparing the univariate parameters with the bivariate model 
parameters. 
Isotropy analysis. To assess whether non-random behaviour was stronger in 
any particular direction (that is, it exhibited isotropy), density plots of the 
K-measure’** were used to calculate the normalized density in each direction 
around each point (Fig. 3 and Extended Data Fig. 3c), where normalized values 
of 1 indicate random distribution (homogeneous Poisson process) compared with 
<1 (segregation) and >1 (aggregation). The plots were produced by calculating 
the average of all the vectors that joined all pairs of points over different realiza- 
tions of the point process. Each point in turn was positioned on the plot centre, 
then a vector drawn to every point. The resulting vector scatter plot (also known as 
a Fry plot) was then smoothed using a Gaussian smoothing kernel”. 
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Extended Data Figure 1 | Map and simplified stratigraphic column 
showing the position of studied bedding planes with bedding plane maps 
of Fractofusus. a, Newfoundland, eastern Canada. Dashed area indicates 
region of interest in b. b, The Avalon and Bonavista Peninsulas, eastern 
Newfoundland. Locations of the bedding planes are indicated. c, Stratigraphic 
column (not to scale) compiled of information from the Avalon and Bonavista 
Peninsulas; lithological units in each region are treated as correlative in 
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this study, but work is continuing to determine the validity of this assumption. 
The ‘F’ surface at Mistaken Point has been dated to 565 + 3 Ma (ref. 12). There 
are currently no available radiometric dates from the Bonavista Peninsula. 
d-f, Maps of Fractofusus positions on the ‘D’ surface (d), the ‘E’ surface (e) and 
the H14 surface (f). In e the largest specimens are in light blue, medium 
specimens in mid-blue and smallest specimens in dark blue. 
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Extended Data Figure 2 | Retrodeformation calculations on the Mistaken (R? = 0.75). ¢, Fractofusus PCF on the ‘E’ surface with (solid line) and without 
Point surfaces. a, b, Plots of the lengths versus widths of discs from the “D’ (dashed line) retrodeformation. The grey shaded area depicts the boundary of 


surface, Mistaken Point (a), and the ‘E’ surface, Mistaken Point (b). The 99 Monte Carlo simulations for the model which provided the best-fit model to 
gradient of the line defines the retrodeformation factor, which for the ‘D’ the retrodeformed data, which has a good fit on the non-retrodeformed data 
surface is 1.35 + 0.11 (R* = 0.92) and for the ‘E’ surface is 1.71 + 0.08 (pa = 0.60). 
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Extended Data Figure 3 | Size distribution analysis of Fractofusus for the 
H14 surface. a, Size-frequency distributions for Fractofusus, (n = 1,214); 
b, the results of the Bayesian information criterion*”°* (univariate data). 
Squares and triangles correspond to models assuming equal and unequal 
variance, respectively. High values of the Bayesian information criterion 
correspond to a good model fit, so the best-fit model is a three-component 
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equal variance model using log-normalized length data. c—e, Rose diagrams 
plotting the directional orientation of the different size-classes of Fractofusus on 
the H14 surface showing large size-class (<11.0 cm, n=350) (c), intermediate 
size-class (5.5-11.0 cm, n=310) (d) and small size-class (<5.5 cm, n=554) (e). 
The angles of the Fractofusus central axis are relative to north (0°). There is no 
strong orientation preference for any of the size-classes. 
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Extended Data Figure 4 | Distance measures for the size data from H14 
surface. For all plots, the x axis is the inter-point distance between organisms 
(in metres). a, Mark correlation function*, where 1 corresponds to a lack of 
correlation of size, such that Fractofusus size is independent and identically 
distributed. A value of <1 corresponds to a positive dependency (in contrast to 
PCF) and >1 corresponds to a negative dependency. Small Fractofusus on the 
H14 surface (<0.3 cm) are more likely to be found near each other than 
expected by random. b, The ‘E’ surface PCF (solid line) showing the model that 
fits the data best, a double Thomas cluster model (dotted line, pg = 0.56), and 


the simulation envelope for 99 Monte Carlo simulations (grey shaded area). 
c, d, PCF for the best-fit models for the bivariate size-classes of Fractofusus on 
the H14 surface showing LCMs for small with medium size-classes (pg = 0.74) 
(c) and LCMs for medium with large size-classes (pg = 0.66) (d). e, The PCF of 
the largest size-class of H14 (solid line), showing the CSR Monte 

Carlo simulation envelope in grey, with the ‘D’ surface PCF (dotted line, 

Pa = 0.56). f, Nearest neighbour distances (solid line, pg = 0.01) with 

CSR Monte Carlo simulation envelope in grey. 
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Extended Data Figure 5 | Artistic reconstruction of Fractofususon theH14 the medium specimens also has small specimens clustered around them. 
surface, Bonavista Peninsula. The bottom right features a large Fractofusus | The small specimens therefore form an independent double cluster pattern, 
around which there are five to eight medium specimens clustered. Each of namely clusters of clusters. Artwork by C.G.K. 
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Extended Data Table 1 | Best-fit univariate cluster models 


Model Fit Pa Values 
Mean Cluster on Cluster on Hetero- Homo- Error 
number mumiber homo- hetero- geneous geneous function 
2 100 in aa geneous geneous Poisson Poisson for best fit 
cluster Background Background Model Model model 
Taxon 

Fractofusus (D) 4.312 0.638 3 338 0.77 0.21 0.51 0.01 0.00346 
Charniodiscus (E) 2.133 1.667 10 16 0.41 0.14 0.13 0.01 0.02438 
Charniid (E) 6.841 0.11 14 ri 0.60 0.11 0.90 0.01 0.00697 
Primocandelabrum (E) 5.616 0.329 11 19 0.28 0.18 0.09 0.01 0.00057 
Fractofusus (E) 11.842 0.39 25 43 0.32 0.35 0.03 0.01 0.00064 
Thectardis (E) 1.835 2.003 4 20 0.91 0.07 0.44 0.02 0.00290 
Fractofusus (H14) 7.906 0.192 2 728 0.25 0.01 0.03 0.01 0.00276 
Small size (H14) 3.291 0.203 ri 771 0.17 0.02 0.01 0.01 0.00021 
Medium size (H14) 4.377 0.188 4 714 0.86 0.15 0.01 0.01 0.01151 
Large size (H14) NA NA NA NA NA NA NA 0.30 NA 
lvesheadiomorphs (E) NA NA NA NA NA NA NA 0.21 NA 
Lobate Discs (E) NA NA NA NA NA NA NA 0.11 NA 
Bradgatia (E) NA NA NA NA NA NA NA 0.44 NA 
Pectinifrons (D) NA NA NA NA NA NA NA 0.28 NA 
Bradgatia (D) NA NA NA NA NA NA NA 0.51 NA 
lvesheadiomorphs (D) NA NA NA NA NA NA NA 0.13 NA 
Charniid (D) NA NA NA NA NA NA NA 0.10 NA 


For the heterogeneous backgrounds, the moving window radius is 0.5 m using the same taxon density as the taxon being modelled. A value of py = 1 corresponds to a perfect fit of the model on the data, 
while pa = 0 corresponds to no fit. The error function of the best-fit model gives the fraction of the total sum of squares for the transformed empirical PCF that is not explained by the model. 
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Extended Data Table 2 | Best-fit univariate double cluster models 


Large Scale Clusters Small Scale Clusters 
Mean Mean Error 
Number function 
Taxon NN 6 100 Bumper of o: 100 Humber Pa on 
(m) 1 P1 of small 2 P2 in value 
clusters Model 
clusters cluster : 
Fit 
Charniodiscus (E) 0.23 11.015 0.395 22 23 «4.184 = 0.203 8 0.63 0.00031 
Fractofusus (E) 0.17 12.191 0.392 12 23 «43.675 0.470 3 0.56 0.00147 
Primocandelabrum (E) 0.27 7.123 0.282 14 16 2.109 4.698 4 0.72 0.00036 
Fractofusus (H14) 0.12 11.859 0.166 6 24 =3.953 1.002 8 0.89 0.00063 
Small size class (H14) 0.13 9.873 0.078 6 19 1.646 0.808 5 0.72 0.00021 


Large-scale clusters are determined for the univariate cluster then input into the model, and the small-scale clusters are determined in the double cluster analysis. A value of py = 1 corresponds to a perfect 
fit of the model on the data, while pg = O corresponds to no fit. NN denotes the mean nearest neighbour distance. 
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Extended Data Table 3 | Best-fit double Thomas cluster models fitted onto other taxa 


Best-fit Model p, value 


Taxon Fractofusus  Fractofusus _ Fractofusus Charniodiscus _Primocandelabrum 


Surface H14 D E E E 
Taxon Surface 
Fractofusus H14 0.89 0.01 0.06 NA NA 
Fractofusus D 0.01 0.77 0.01 NA NA 
Fractofusus E 0.51 0.01 0.56 0.16 0.07 
Charniodiscus E NA NA 0.02 0.68 0.01 
Primocandelabrum E NA NA 0.51 0.01 0.72 


A value of pg = 1 corresponds to a perfect fit of the model on the data, while pq = O corresponds to no fit. Note that while these numbers may seem low (such as the ‘E’ surface fit), they need to be considered in 
context of the PCF graph (Extended Data Fig. 4b), which clearly shows a good fit to the data, with the small fluctuations of the observed PCF around the model PCF. 
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Extended Data Table 4 | Best-fit univariate cluster models on heterogeneous backgrounds for ‘E’ surface taxa 
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Taxon 


Charniodiscus 


Charniodiscus 


Fractofusus 


Fractofusus 


Primocandelabrum 


Primocandelabrum 


Charniid 


Fitted Model Parameters 


Heterogeneous Cluster Models p, values 


Spatial 
Scale 


Ss 


Random 


11.015 


2.856 


22.556 


3.662 


7.123 


2.090 
NA 


100p 


0.395 
2.880 
0.114 
2.169 
0.282 


1.057 
NA 


Mean 
number 
in 
Cluster 


22 


14 


4 
NA 


Number 
of 
Clusters 


23 


NA 


0.09 


0.08 


0.01 


0.16 


0.05 
0.60 


CHai 


0.45 


0.20 
0.02 


CHive 


0.14 
0.03 
0.03 
0.01 
0.41 


0.18 
0.04 


CHerac 


0.16 
0.05 
0.13 
0.01 
0.42 


0.17 
0.08 


CHcha 


0.14 


0.16 
0.03 


CHperimo 


0.04 
0.14 
0.01 
0.01 
0.18 


0.07 
0.04 


Hera 


NA 
NA 
NA 
NA 
NA 


NA 


0.18 


H Lob 


NA 
NA 
NA 
NA 
NA 


NA 


0.08 


Hthe 


NA 
NA 
NA 
NA 
NA 


NA 


0.12 


Hchar 


NA 
NA 
NA 
NA 
NA 


NA 


0.90 


Univariate clusters, either fitted to the small scale (S) or large scale (L) were modelled on different backgrounds defined by the density map of all taxonomic groups, or random for charniids. C, Thomas cluster on 
homogeneous backgrounds; CH, Thomas cluster on heterogeneous backgrounds. For the heterogeneous backgrounds, the moving window radius is 0.5 m since that radius produced the best fit for charniids: 
heterogeneous cluster model on a background density constructed from all species (CHa), ivesheadiomorphs (CHiye), Fractofusus (CHr;ac), Charniodiscus (CHcna) and Primocandelabrum (CHprimo). CSR on 

heterogeneous background (H), Bradgatia (Hera), lobate discs (Hob), Thectardis (Hthe) and charniids (Hchar). A value of pa = 1 corresponds toa perfect fit of the model on the data, while pg = 0 corresponds to no fit. 
The H14 surface did not possess enough non-Fractofusus specimens to perform similar analyses. NA, not applicable. 
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Extended Data Table 5 | Models for bivariate analysis between different size-classes of Fractofusus on the H14 surface 


Size Size Model Number Number Error 
class class type O1 100p; 2 100p2 Ot Obest 100pbest in of Pa on Fit 
1 2 yP Cluster Clusters 

Small Medium SP 2.829 0.3599 5.714 0.2359 451 7.857 0.1372 16 896 0.03 0.00336 
Small Medium LCM 2.829 0.3599 5.913 0.1027 NA 5.714 = 0.2359 11 23 0.74 0.00172 
Medium Large SP 5.913 0.1027 2.836 0.4054 3.7 2.428 1.0128 5 1018 0.01 0.01087 
Medium Large LCM 5.913 0.1027 2.836 0.4054 NA 1.512 0.1602 5 81 0.66 0.00771 
Small Large SP 2.829 0.3599 2.836 0.4054 283 11.42 0.2337 23 1366 0.03 0.00179 
Small Large LCM 2.829 0.3599 2.836 0.4054 NA 28.15 0.1844 56 700 0.01 0.00386 


A value of pg = 1 corresponds to a perfect fit of the model on the data, while py = 0 corresponds to no fit. The large size-class was randomly distributed, but was approximated by a cluster model, which was 
required for input into Programita®. 
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Tropical mountains are hot spots of biodiversity and endemism’”’, 
but the evolutionary origins of their unique biotas are poorly 
understood*. In varying degrees, local and regional extinction, 
long-distance colonization, and local recruitment may all contrib- 
ute to the exceptional character of these communities’. Also, it is 
debated whether mountain endemics mostly originate from local 
lowland taxa, or from lineages that reach the mountain by long- 
range dispersal from cool localities elsewhere®. Here we investigate 
the evolutionary routes to endemism by sampling an entire trop- 
ical mountain biota on the 4,095-metre-high Mount Kinabalu in 
Sabah, East Malaysia. We discover that most of its unique biodi- 
versity is younger than the mountain itself (6 million years), and 
comprises a mix of immigrant pre-adapted lineages and descen- 
dants from local lowland ancestors, although substantial shifts 
from lower to higher vegetation zones in this latter group were 
rare. These insights could improve forecasts of the likelihood of 
extinction and ‘evolutionary rescue” in montane biodiversity hot 
spots under climate change scenarios. 

In mountainous areas of the humid tropics, steep environmental 
gradients coincide with high primary productivity and relative cli- 
matic stability to sustain large numbers of species, often with strik- 
ing degrees of endemism at higher elevations’. It has therefore been 
recognized that tropical mountains are biodiversity hot spots of great 
conservation value’, especially because endemics on mountain tops 
are vulnerable to becoming trapped and then annihilated as a result 
of global warming”’. 

The evolutionary origins of these unique biotas, however, are poorly 
understood’. Like other insular habitats’, the endemic biota of an 
isolated mountain results from complex dynamics among coloniza- 
tion, in situ speciation, and local extinction. Each of these factors is 
dependent on the age and size of the habitat, and on the environmental 
contrast between the insular habitat and its matrix’. In the case of 
a tropical mountain top, an added complication is the fact that 
climate fluctuations may have widened and restricted the geographic 
range over which the montane conditions have extended in the past, 


meaning that parts of the endemic biota may be relicts, and other 
components may be novel in character*”. 

Disentangling these possibilities for a single tropical montane 
biodiversity hot spot requires molecular phylogenetic study of a large 
number of fauna and flora elements. However, with only few excep- 
tions'’”’, evolutionary studies in such hot spots have been limited 
to single taxa’*"*. This precludes broad understanding of the evolu- 
tionary and biogeographic origins of an endemic biota as a whole’. 

We investigated the evolutionary routes to endemism by sampling 
an entire tropical mountain biota on the UNESCO World Heritage site 
of Gunung Kinabalu in Sabah, East Malaysia. We demonstrate that 
most of its unique biodiversity is younger than the mountain itself and 
comprises a mix of immigrant pre-adapted lineages as well as descen- 
dants from local lowland ancestors. 

At 4,095 m, Kinabalu is the tallest mountain between the Himalayas 
and New Guinea. It isa solitary ‘sky island’, having emerged during the 
Pliocene and early Pleistocene as a granite pluton within the surround- 
ing sandstone of the Crocker Range, the latter having formed much 
earlier, between the Eocene and the early Miocene’’. Because of the 
area’s tectonic activity, as well as Pleistocene sea level changes, the 
exact historical progression of its elevation above sea level is not 
known, but it is likely that a major rise, even beyond today’s elevation, 
of Kinabalu, as well as the central spine of the Crocker Range, took 
place between 6 million years ago and today (for more geological 
background see Methods). Since the early days of its exploration’®, 
Kinabalu has been famous for its extremely high biological diversity, 
especially its richness in endemic species, with endemism proportions 
reaching 25-30% for some taxa’’. 

To unravel the origins of the exceptionally rich Kinabalu biota, we 
mounted a Malaysian-Dutch expedition in which 47 taxonomists 
worked at 37 localities, spanning the full range of elevations (Fig. 1). 
We used Sanger sequencing to sequence one or more fast-evolving loci 
for 1,852 individuals, belonging to 18 genera representing gastropods, 
annelids, insects, arachnids, vertebrates, pteridophytes, bryophytes, 
and angiosperms. We also obtained 3.7 million basidiomycete 
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Figure 1 | Map of the study area. Inset left, location of the study area in the 
World and in Borneo. Inset middle, detail of the summit trail in Kinabalu Park. 
The eight expedition stations in Kinabalu Park and Crocker Range Park are 


and glomeromycete ITS2 rDNA sequences from soil cores with ion 
semiconductor sequencing. In addition, we retrieved data from eight 
previously published single-taxon studies on vascular plants. 

We analysed all data within a phylogenetic framework to estimate 
the times of origin of endemic species, and to determine whether 
endemic species had descended from local or distant congeners 
(Methods, Fig. 2, Extended Data Table 1 and Extended Data Figs 1-9). 
Note that we define ‘endemic’ as restricted to the area in which 
our expedition took place. Although the present study offers the 
most comprehensive evolutionary analysis of any mountain biota to 
date, the taxa covered are, by necessity, an uneven and fragmentary 
sampling of the full diversity. Nonetheless, we expect that our 
results are representative for the Kinabalu biota as a whole, as our 
selected taxa encompass organisms with a wide variety of phylogen- 
etic backgrounds, ecologies, and life history traits. 

Similar to Mesoamerican endemic cloud forest seed plants and 
vertebrates'’, our molecular dating results show that the estimated 
mean stem-node ages of 33 endemic species span a wide range, from 
0.12 million years to 14.6 million years (Fig. 3). However, 76% of these 
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Figure 2 | Examples of phylogenetic reconstructions. a, Centric endemism 
(Ptomaphaginus). b, Eccentric endemism (Impatiens). c, Centric endemism 
superimposed on eccentric endemism (detail of the Cortinarius phylogeny). 
Clades are coloured with the same colour scheme as in Fig. 1, by elevational 
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indicated with red markers, ten additional sampling sites with blue markers. 
Not indicated separately are 15 sites along the summit trail, and four sites very 
close to Mahua, Gunung Alab, and Inobong. 


fall within the past 6 million years, the time span during which 
Kinabalu is likely to have reached its present elevation. Only two 
endemics, the frog Kalophrynus baluensis and the flowering plant 
Ilex kinabaluensis, are markedly older than the mountain itself. 
These may be explained as artefactual if we failed to identify the 
closest non-endemic sister lineage, thereby inflating their recon- 
structed age, or if these species are actually not endemics, but more 
widespread. Alternatively, they may truly be old endemics that evolved 
during cooler periods at lower elevations in Borneo before Kinabalu’s 
formation. 

Our phylogenetic and biogeographic analyses (Extended Data Table 1 
and Extended Data Figs 1-4, 7-9) suggest the existence of two categories 
of endemics'*: ‘eccentric’ (12 taxa) and ‘centric’ (25 taxa). The eccentric 
type of endemic has sister taxa that occur either in temperate climates 
(seven cases) or in other tropical mountains outside of Borneo (five 
cases). To this group belong all bryophytes, pteridophytes, some of the 
fungal lineages and also the endemics in the flowering plant groups 
Hedyotis, Ilex, Impatiens, Ranunculus and Euphrasia, and the animals 
Coeliccia and Tritetrabdella. Eccentric endemics predominantly occur at 
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zone at which they occur (see legend). High-elevation endemics in a and b are 
indicated in red. For support values and other details on these and other trees, 
see Extended Data Figs 1-4. 
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Figure 3 | Elevations and ages for endemic species. Elevations (mid-points, 
minima and maxima) and dates of origination derived from molecular 
dating (averages and 95% credible intervals) for endemic species; eccentric 
species (see main text) are indicated with a black circle. The vertical dashed line 
indicates the oldest possible date for Kinabalu to have reached its current 
elevation. For details, see Extended Data Figs 5 and 6. 


high elevations (mean lower elevational boundary, 2,212 m; s.d., 837 m), 
they are strict Kinabalu endemics (they do not occur on nearby, lower 
mountains), and are further characterized by high dispersal capacities 
(one, two and seven clades with eccentric endemism have small, medium 
and large dispersal, respectively). 

The centric type of endemic has sister taxa that occur locally, in the 
Borneo lowlands. Six out of the eight endemic animal species, some of 
the fungal lineages, the conifers, and 17 of the 20 flowering plants 
belong to this type. Centric endemics occur on average at lower eleva- 
tions (mean lower elevational boundary, 1,724 m; s.d., 728m) and 
tend to have lower dispersal capacities than the eccentric endemics 
(eight, seven and ten clades with centric endemism have small, med- 
ium and large dispersal, respectively). Some centric endemics are 
not strict Kinabalu endemics, as they are also found on other moun- 
tains in Kinabalu’s vicinity. Our ancestral state reconstructions 
(Supplementary Table 6) confirm a pattern of altitudinal bottom-up 
cladogenesis in the centric endemics, with 8 out of 11 species ranging 
partially or entirely outside of the 95% credibility interval for the 
reconstructed elevation of the most recent common ancestor, and 
the remaining 3 out of 11 falling entirely within this interval. When 
we performed the same analysis with Kitayama’s (ref. 19) seven vegeta- 
tion zones, rather than elevation, we found that only three of these 
eight elevation shifts may represent a shift towards a higher vegetation 
zone. This suggests that, even in high-elevation centric endemics, sub- 
stantial niche shifts are rare. Niche conservatism is probably even 
underestimated, since the Massenerhebung effect causes an upward 
shift of vegetation zones on Kinabalu compared with other Bornean 
mountains'*”°, 

Our multi-taxon study shows that Kinabalu’s biodiversity hot spot 
is of recent origin. This means the mountain is an evolutionary 
cradle, accumulating neo-endemics, as has been suggested for other 
young, high-elevation biodiversity hot spots at low latitudes*', such 
as the Tibetan plateau’, the Andean highlands’, and Afrotropical 
volcanoes”*. However, probably as a consequence of the rapid emer- 
gence of the mountain’ and its unique alpine summit conditions”, 
many of these neo-endemics have not evolved by drastic niche 
shifts from local ancestors, but rather by immigration of pre-adapted 
propagules from elsewhere. This explains the multiple independent 
colonization events in some taxa (for example, Glomus, Rhododendron 
and Coeliccia). In addition, local lowland taxa have also generated 
montane species, but although some of these have reached vegetation 
zones above 2,000 m, most do not show substantial niche shifts away 
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from their ancestral niche. The fact that the endemic biota of Kinabalu 
appears to be composed largely of pre-adapted (eccentric) species and 
locally derived (centric), ecologically conserved endemics is in line 
with niche conservatism”. 

We suggest that our novel approach of molecular dating of multiple 
clades be applied to larger communities in this and other tropical 
montane biodiversity hot spots”. In combination, such information 
should allow a detailed dissection of the relative roles of ecological 
speciation, colonization and habitat filtering in the formation of 
endemic biotas in this and other tropical mountains. Moreover, such 
understanding could improve predictions of the likelihood of extinc- 
tion and evolutionary rescue of endemic species experiencing chan- 
ging climate conditions’. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Expedition. Material was sampled chiefly between September 10th and 25th, 
2012, by the participants of the Crocker Range/Kinabalu Scientific Expedition, 
which was conceived, organized, funded and conducted jointly by Sabah Parks 
(Malaysia) and Naturalis Biodiversity Center (The Netherlands). The expedition 
was set up with the explicit aim to investigate evolutionary patterns of endemism 
(see main text), and participants were selected to maximize taxon coverage and 
ecological diversity. The 47 expedition members were asked to identify one or 
more congeneric species complexes (hereafter termed ‘genera’) within their taxon 
of expertise that included high elevation endemics and preferably also more wide- 
spread lowland species (although there is a possibility that this approach would 
leave out any monotypic endemics, in the groups studied by us, such species are 
not known). These, plus outgroup species, were then sampled (using taxon- 
specific methods; see Supplementary Table 2 for further details) at eight expedition 
stations located at a range of elevations (Fig. 1), and also in additional locations in 
both parks and their vicinity. Data on time, latitude and longitude, elevation, and 
habitat were collected with each sample. Material was subsampled for DNA ana- 
lysis, mostly in the field, by preserving tissues or entire organisms, usually either 
in pure ethanol or in silica gel (see Supplementary Table 2 for taxon-specific 
subsampling and storage). In the case of fungi, mycelial DNA was obtained by 
bulk extraction from composite soil cores (see Supplementary Table 2 and below). 
In addition to the data collected during the expedition, the expedition members 
were asked to provide a complete assessment of elevational and geographical 
distribution for each of the species within their taxon of expertise, based on 
published sources and previous collections. 

Molecular analysis. Upon arrival in the Naturalis DNA barcoding facility, sub- 
samples were processed in a semi-automated way. DNA extractions on animal 
material were performed on either tissue samples or individual arthropod legs, or 
using whole arthropods in a non-destructive extraction method. DNA was 
extracted using the NucleoMag 96 Tissue kit (Macherey-Nagel Gmbh & Co., 
Diiren, Germany) on the KingFisher Flex magnetic particle processor (Thermo 
Scientific). DNA extractions on plant tissue were performed on dried leaf samples, 
using the NucleoMag 96 Plant kit (Macherey-Nagel Gmbh & Co., Diiren, 
Germany). The KingFisher Flex protocols were slightly adjusted from the manu- 
facturer’s protocol (http://www.thermoscientific.com) to include longer mixing of 
beads with lysed sample and a drying step before elution. A volume of 150 jl was 
used for elution in both animal and plant samples. Soil samples containing fungal 
mycelia were processed by pooling 40 cores (for sampling details, see 
Supplementary Table 2), resulting in a composite soil sample for each site. 
Genomic DNA was extracted from 1 g of this sample using the NucleoSpin Soil 
kit (Macherey-Nagel Gmbh & Co., Diiren, Germany), according to the manufac- 
turer’s protocol. For each sample DNA extraction was carried out twice and 
replicates were combined. 

For each specimen, the accepted barcode marker for its taxon was sequenced, in 
many cases augmented with one or more other phylogenetically informative loci. 
Supplementary Tables 1 and 2 provide lists of the loci and primers used for each 
genus included in the study. Where relevant for outgroup comparison or further 
geographic resolution, we then obtained additional sequences from Borneo or 
elsewhere from GenBank or from our own, unrelated studies. GenBank accession 
codes for these are given in Supplementary Table 2. After sequencing, (remains of) 
voucher specimens were mostly deposited in the collections of Sabah Parks and/or 
Naturalis Biodiversity Center. Voucher collection codes are listed in Supple- 
mentary Table 2. 

PCR reactions generally contained 18.75 yl ultrapure HO, 2.5 ul 10 PCR 
buffer, 1.0 1 10mM of each primer, 0.5 pl 2.5 mM dNTPs and 0.25 ul 5U pl! 
Qiagen Taq. Most PCRs were performed using an initial denaturation step of 180 s 
at 94°C, followed by 40 cycles of 15 s at 94°C, 30 s at 50°C and 40 s at 72°C, 
and ended with a final extension of 300 s at 72°C and pause at 12°C. 
Bi-directional Sanger sequencing was performed at either Macrogen (http:// 
www.macrogen.com) or BaseClear (http://www.baseclear.com). Sequences were 
edited manually (primers trimmed off and obvious reading errors corrected) using 
Sequencher 4.10.1 (Gene Codes Corporation). For the fungal soil samples, the 
amplicon library was sequenced using an Ion 318'™ Chip in an Ion Torrent 
Personal Genome Machine (PGM; Life Technologies, Guilford, CT, USA). PCR, 
sequencing, raw data filtering and all bioinformatic analyses were performed as 
described in detail previously’®. From these fungal soil sequences, we selected non- 
redundant sequences of the ectomycorrhizal genus Cortinarius and the arbuscular 
mycorrhizal genus Glomus because they were distributed along the entire altitu- 
dinal range of our sampling. 
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Phylogenetic analysis. We performed initial exploratory alignment and created 
neighbour-joining trees. This allowed an assessment of individuals that (on the 
basis of cursory morphological examination) had been included as potential 
(undescribed) members of the species complexes of choice. In some cases, the 
sequence analysis showed that these individuals were in fact unrelated and such 
sequences were then removed before further analysis. Then, using Geneious (ver- 
sion 6.0.5; created by Biomatters; available from http://www.geneious.com/) as a 
workbench, sequence data were aligned with MUSCLE”. We ran ModelTest” to 
determine, using Akaike Information Criterion (AIC), the best model of nucleo- 
tide substitution per alignment, for each genetic marker separately. We then 
performed a Bayesian analysis using MrBayes 3.2.2 (ref. 28) with settings for 
multiple markers unlinked, running two replicates of a minimum of 10’ genera- 
tions each. Convergence was checked in Tracer (v1.5; ref. 29) and, after discarding 
a 25% burn-in, the resulting majority-rule consensus trees were preserved. For the 
fungal data sets, multiple sequence alignments were built using MUSCLE and, due 
to the large size of these data sets, phylogenetic analyses of maximum likelihood 
(ML) were carried out in Garli 2.0 (ref. 30) using the GTR+I+G model with 
estimated values of the alpha parameter of the gamma distribution of variable 
sites (G) and proportion of invariable sites (I). Clade support was assessed by non- 
parametric bootstrap analyses with 100 pseudo-replicates. 

Molecular dating. The times of divergence of Mount Kinabalu endemics from 
their most closely related extant relatives were estimated with a molecular clock 
approach. We included not only our own data, but also incorporated published 
studies on genera other than the ones we sampled. For each genus, taxon sampling 
and molecular data used are specified in Supplementary Table 4. All data sets 
were assembled in Geneious (ver. 5.5.6; created by Biomatters; available from 
http://www.geneious.com/). Data sets were pruned to contain a single taxon per 
species by selecting a random sequence from each species. Sequence alignments 
were obtained with the MAFFT (ver. 6.814b) alignment tool implemented in 
Geneious. Divergence time estimates were obtained using a relaxed molecular 
clock approach with BEAST (ver. 2.1.3; ref. 31). For each DNA region used, an 
unlinked substitution model was implemented as selected with jModeltest (ver. 
2.1.5; ref. 27) under the AIC. For the tree prior, a Yule birth rate prior was selected 
with a flat prior on the Yule birth rate. A single clock and tree model was used for 
each analysis. All analyses were run with an uncorrelated log-normal clock model, 
to which an exponential prior was assigned for both the mean (ucldMEAN; mean 
set at 10.0) and the standard deviation (ucldStdev; mean set at 0.33). Time cal- 
ibration was done using: (1) fossil constraints; (2) secondary calibration points; or 
(3) substitution rates, depending on the information available for each particular 
genus. In cases where suitable fossil or secondary calibration information was not 
available, we used a priori information about the substitution rate of one of the 
gene partitions—as opposed to the ‘fix the mean substitution rate’ option across all 
partitions—to optimize the analyses. The full details on calibration methods used 
for each taxon are given in Supplementary Table 5. For each run, posterior dis- 
tributions of parameters were approximated using a Markov chain Monte Carlo 
analysis of 5 X 10” generations, sampling every 5,000 generations, discarding a 
burn-in of 5 X 10° generations (10%). Convergence of the chains was checked by 
evaluating the ESS values of each parameter with Tracer v1.6. 

Biogeographical, elevational and vegetation zone analyses. For each genus, an 
independent estimation of the dispersal neighbourhood size was obtained from 
the spatial ecology literature on the genus concerned or related genera, and/or 
expert knowledge. We used this information to classify dispersal neighbourhood 
(two standard deviations for the distribution of distances between birth and 
reproduction for individuals of a certain species) in the genus in question as ‘small’, 
<1km (mostly flightless animals), ‘medium’, 1-100km (flying or ballooning 
animals, and plants dispersed by flying animals), or ‘large’, >100km (plants 
and fungi with very small wind-dispersed seeds or spores). An overview of 
dispersal assessments is in Supplementary Table 3. 

Ancestral ranges were estimated for each taxon group under maximum like- 
lihood with BioGeoBears” using the three-parameter DEC+J model. The ultra- 
metric tree from the Bayesian relaxed molecular clock analysis was used as input 
tree. This tree was pruned to include the clade with the endemic species and at least 
one deeper node (for most groups two deeper nodes were included). Each species 
was scored to be absent or present in Mount Kinabalu (K), Borneo (B), and the rest 
of the world (W). For each group the analysis was run with equal probability for 
exchange between the three areas over the entire diversification time of the tree. 
The maximum number of areas was assigned as the maximum number of areas 
observed in the extant taxa (either 2 or 3). The optimizations, with the ancestral 
range with the highest likelihood for each node, are shown in Extended Data 
Figs 7-9). We classify a “W’ to ‘K’ or ‘KW’ shift in the lineage leading to a species 
endemic to Kinabalu as ‘eccentric endemism’. All other ancestral ranges leading 
to Kinabalu endemics are scored as ‘centric’. 
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For each genus sampled during the expedition that showed centric endemism, 
we added the elevation at which it had been collected as a continuous character. In 
BayesTraits 2.0 (ref. 33) we reconstructed ancestral states for elevation at the node 
leading to the endemic species and its sister clade using the continuous random 
walk model which implements a generalized least squares model for continuously 
varying traits. The elevation of the most-recent common ancestor of the node of 
interest was inferred using a MCMC approach on a set of 2,700 trees randomly 
sampled after the phylogenetic inference with MrBayes (excluding burn-in). The 
MCMC chain of BayesTraits was run for 10° generations, discarding the results 
sampled during the first 10° generations as burn-in. We then inspected whether 
the current range of elevation of the endemic species stretches beyond the 95% 
credible interval for the most-recent common ancestor. If it stretches above this 
interval, we considered this indicative of a ‘bottom-up’ scenario, whereas the 
reverse would suggest a ‘top-down’ scenario. The results are summarized in 
Supplementary Table 6. 

In parallel with the elevational analysis, we conducted an analysis in which we 
coded the vegetation belt(s) as taxon traits. We used the zones defined for Mt. 
Kinabalu in ref. 19, namely: (i) lowland vegetation (below 600 m); (ii) hill vegeta- 
tion (600-1,200 m); (iii) lower montane vegetation (1,200-2,000 m); (iv) upper 
montane vegetation (2,000-2,800 m); (v) subalpine vegetation (2,800-3,400 m); 
(vi) upper subalpine vegetation (3,400-3,700 m); (vii) alpine vegetation (above 
3,700 m). The probabilities at each vegetation belt for the most-recent common 
ancestor of the node leading to the endemic clade and its sister group were inferred 
with the continuous-time Markov model for discrete data implemented in 
BayesTraits 2.0. The same input trees were used as for the elevation reconstruction, 
and the MCMC was run for 10° generations, discarding the results sampled during 
the first 10° generations as burn-in. The resulting probabilities are shown in 
Supplementary Table 7. 

Geological summary. The following is based on refs 15,34 and R. Hall, personal 
communication, 2014. 

Orogeny in the region began at about 23 million years ago (Mya) as a result of 
the northward movement of the Australian plate leading to the collision of 
Australia and Southeast Asia in what today is eastern Indonesia. Orogeny in the 
Kinabalu area began at about 20 Mya after collision of north Borneo with the 
South China continental crust. This led to the emergence of most of Sabah. Soon 


after this emergence much of Sabah became submerged, although a north-south- 
oriented mountain belt formed of tightly folded mudstones and sandstones (the 
future Crocker Range) remained. At the same time, the climate changed from 
seasonal to ever-wet. Mapping of former shelf edges offshore shows that later the 
Crocker Range widened with time, implying that it became higher in its central 
parts. Between 8 and 7 Mya a granitic body intruded the crust and cooled rapidly 
from ~7 to 5 Mya. Melting could have resulted from a build-up of heat from 
radioactive decay in the deep crust, or from decompression melting following 
extension. After 6 Mya (the precise timing and duration is not known) the 
area (including the surrounding Crocker Range) rose further, elevating to the 
point that it was glaciated recently, possibly after a lithospheric root detached 
and sank into the deeper mantle, leading new mantle material to flow into the 
vacated space below the Crocker Range. Extensive glacial weathering of the sum- 
mit suggests the mountain had reached its present elevation by at least 1 Mya. 
The Pinosuk gravels on Kinabalu’s south-eastern slope are the accumulated ero- 
sional debris of young glaciation. Today, the mountain is still rising by 0.5 mm 
per year. 
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Aranea - Myrmarachne 
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Extended Data Figure 1 | Phylogenetic reconstructions for Anura, Araneae, Athyriaceae, Balsaminaceae and Begoniaceae. Details of materials and methods 
used are given in Supplementary Table 2. 
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Extended Data Figure 2 | Phylogenetic reconstructions for Coleoptera, Dicranaceae, Diptera and Cortinariaceae. Details of materials and methods used are 
given in Supplementary Table 2. 
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Extended Data Figure 3 | Phylogenetic reconstructions for Ericaceae, Gastropoda, Glomeraceae, Hemiptera and Hirudinea. Details of materials and 
methods used are given in Supplementary Table 2. 
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Extended Data Figure 4 | Phylogenetic reconstructions for Nepenthaceae, Odonata, Polygalaceae, Rubiaceae and Trombidiformes. Details of materials and 
methods used are given in Supplementary Table 2. 
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Athyrium Orchidaceae — Dendrochilum Hedyotis 
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Extended Data Figure 5 | Chronograms for Athyrium, Begonia, Coeliccia, _ for more details. Node colours represent posterior probabilities. Endemic 
Dacrycarpus, Epirixanthes, Dendrochilum, Dicranoloma, Impatiens, species printed in red. 
Hygrobates, Hedyotis, Ilex and Myrmarachne. See Supplementary Table 5 
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Euphrasia Paphiopedilum Rhododendron 
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Extended Data Figure 6 | Chronograms for Euphrasia, Kalophrynus, Weinmannia. See Supplementary Table 5 for more details. Node colours 
Nepenthes, Paphiopedilum, Philalanka, Ptomaphaginus, Payena, represent posterior probabilities. Endemic species printed in red. 
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Extended Data Figure 7 | Ancestral range estimations for Athyrium, Begonia, Coeliccia, Dacrycarpus, Dendrochilum, Dicranoloma, Epirixanthes, 
Euphrasia and Hedyotis. See Methods for further details. The trees are alphabetically ordered by genus name. 
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Extended Data Figure 8 | Ancestral range estimations for Hygrobates, Ilex, Impatiens, Kalophrynus, Myrmarachne, Nepenthes, Paphiopedilum, 
and Philalanka. See Methods for further details. The trees are alphabetically ordered by genus name. 
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BioGeoBEARS DEC+J on Weinmannia MO_unconstrained 
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Extended Data Figure 9 | Ancestral range estimations for Ptomaphaginus, Ranunculus, Rhododendron, Teleopsis, Trichosanthes, Tritetrabdella and 
Weinmannia. See Methods for further details. The trees are alphabetically ordered by genus name. 
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Extended Data Table 1 


Details for all studied endemic taxa 


Endemic taxon Origin’* Dispersal Region**** Elevation Mean divergence | 95% Cl Occurrence of sister 
capacity*** range (m) (mya) lineage 
Dicranoloma havilandii eccentric Large K 2400-3500 | 0.4 0-13 Australia 
Athyrium clemensiae eccentric K 2750-4000 | 10.7 3.8 - 19.2 China, Japan (800- 
2600 m) 
Athyrium amoenum eccentric K 2100-3500 | 2.4 On bey China, Japan (1500- 
3000 m) 
Dacrycarpus centric K 2100-4000 | 5.7 0.7-11.8 Tropical SE Asia, incl. 
kinabaluensis 1 Kinabalu (700-2700 
m) 
Dacrycarpus centric K 2100 — 4000 | 1.2 0-39 Tropical SE Asia, incl. 
kinabaluensis 2 Kinabalu (700-2700 
m) 
Nepenthes centric K,T 1600 — 3240 | 14.3 5.1-25.9 Tropical SE Asia, incl. 
edwardsiana-N. vilosa mountains on Borneo 
600-2650 m) 
Nepenthes rajah-N. x centric 1500 — 3030 | 10.5 3.1-19.1 Tropical SE Asia, incl. 
kinabaluensis mountains on Borneo 
(600-2600 m) 
Epirixanthes centric 900 — 1200 7.2 25-127 Tropical SE Asia, incl. 
kinabaluensis Kinabalu 
Trichosanthes centric 1000-1600 | 3.8 0.7-7.4 Borneo lowlands 
kinabaluensis 
llex kinabaluensis eccentric 2700 14.6 8.3 - 21.7 East Asia 
Impatiens kinabaluensis_| eccentric 800 - 1700 13 0.2-2.4 Vietnam 
Begonia vaccinoides-B. | centric 1500 — 2900 | 1.4 0.9—1.9 North Borneo (incl. 
burbidgei Kinabalu), montane 
forest 
Dendrochilum clade 1 centric 2100-3800 | 0.9 OPS 5) Borneo mountains 
(incl. Kinabalu) 
Dendrochilum clade 2 centric 1500 — 3200 | 1.3 OS S270 Borneo mountains 
incl. Kinabalu) 
Dendrochilum clade 3 centric Large 1500 - 3700 | 0.7 0.1-1.6 Borneo mountains (not 
Kinabalu) 
Dendrochilum clade 4 centric Large 1050-2000 | 0.5 On-13 Borneo mountains 
incl. Kinabalu 
Paphiopedilum centric Large K 500 — 1800 2.0 0.7-3.7 Borneo lowland 
rothschildianum 
Ranunculus lowii eccentric Large K 3000-4000 | 04 On 019 New Guinea 
Hedyotis macrostegia eccentric 2700 — 4000 | 1.0 0.3-1.7 Borneo (incl. Kinabalu) 
Euphrasia borneensis eccentric Large K 2400-4000 | 3.5 2.2-5.0 East Asia 
Payena kinabaluensis centric 800 — 1000 1.6 0.6-2.9 Borneo (lowland and 
lower montane forest) 
Rhododendron centric Medium K 2400-4000 | 2.9 0.0-4.0 North Borneo (incl. 
ericoides Kinabalu), lower and 
mid montane forest 
Rhododendron centric Medium K 2400 - 3600 | 4.3 0.8 -5.8 North Borneo (incl. 
buxifolium Kinabalu), montane 
forest 
Rhododendron centric Medium K 2400 — 3400 | 4.1 0.3-5.9 East Asia (incl. 
acuminatum Borneo), montane 
forest 
Weinmannia centric Large K 2400 — 2500 | 9.2 3.4- 16.6 Fiji and Malaysia (incl. 
clemensiae Borneo), lowland to 
lower montane forest 
Glomus clade 1 centric Large K 2850 — 3600 | n/a n/a Kinabalu and Crocker 
Range, <2850 m elev. 
Glomus clade 2 eccentric Large K 2850 — 3600 | n/a n/a Europe 
Cortinarius Clades 1, 3, eccentric Large K 2850 — 3600 | n/a n/a Holarctic, South 
4, 10, 11, 12, 14, 18, 20, Pacific 
21, 23, 24, 26, 27, 28, 
Sie 
Cortinarius clades 2, 5- centric Large K 2850 — 3600 | n/a n/a Kinabalu and Crocker 


9, 13, 15-17, 19, 22, 25, 


Plants are highlighted in green, fungi in blue, and animals in brown. 

* For clade numbering in Cortinarius, see Supplementary Table 2. 

** Centric/eccentric status determined via biogeographic analysis (see Extended Data 4). 
** For definitions and sources, see Supplementary Table 3. 

**** K Mount Kinabalu; T, Mount Tambuyukon; C, Crocker Range; TM, Mount Trus Madi. 


##** Manuscript name. 


©2015 Macmillan Publishers Limited. All rights reserved 


Range, <2850 m elev. 


Mae A lea 


doi:10.1038/nature14603 


A self-organized biomechanical network drives 
shape changes during tissue morphogenesis 


Akankshi Munjal', Jean-Marc Philippe’, Edwin Munro” & Thomas Lecuit! 


Tissue morphogenesis is orchestrated by cell shape changes. Forces 
required to power these changes are generated by non-muscle myo- 
sin II (MyolII) motor proteins pulling filamentous actin (F-actin). 
Actomyosin networks undergo cycles of assembly and disassembly 
(pulses)'” to cause cell deformations alternating with steps of sta- 
bilization to result in irreversible shape changes**. Although this 
ratchet-like behaviour operates in a variety of contexts, the under- 
lying mechanisms remain unclear. Here we investigate the role of 
MyollI regulation through the conserved Rhol-Rok pathway’ 
during Drosophila melanogaster germband extension. This mor- 
phogenetic process is powered by cell intercalation, which involves 
the shrinkage of junctions in the dorsal-ventral axis (vertical junc- 
tions) followed by junction extension in the anterior—posterior 
axis®. While polarized flows of medial-apical MyolI pulses deform 
vertical junctions, MyolI enrichment on these junctions (planar 
polarity) stabilizes them®’. We identify two critical properties of 
MyollI dynamics that underlie stability and pulsatility: exchange 
kinetics governed by phosphorylation-dephosphorylation cycles 
of the Myoll regulatory light chain; and advection due to contrac- 
tion of the motors on F-actin networks. Spatial control over MyolI 
exchange kinetics establishes two stable regimes of high and low 
dissociation rates, resulting in MyolII planar polarity. Pulsatility 
emerges at intermediate dissociation rates, enabling convergent 
advection of MyolI and its upstream regulators Rhol GTP, 
Rok and Myoll phosphatase. Notably, pulsatility is not an outcome 


i) 
[3 


io” WT RLC-GFP + water 
MBS-GFP 


P< 0.0005 w RokKD/Myoll (n = 172 junctions, 5 embryos) 


of an upstream Rhol pacemaker. Rather, it is a self-organized 
system that involves positive and negative biomechanical feedback 
between Myoll advection and dissociation rates. 

To understand the mechanisms that underlie pulsatility (shape 
deforming) and stability (shape stabilizing) of actomyosin networks 
during morphogenesis, we investigated the role of the Rhol pathway 
during germband extension (GBE)’. Rhol GTPase activates Rho kinase 
(Rok)’, which in turn phosphorylates the MyolI regulatory light chain 
(RLC), resulting in assembly of processive bipolar filaments that bind 
F-actin to generate contractility’. Rok also inhibits the myosin-binding 
subunit (MBS)"' of the MyollI phosphatase complex, responsible for 
dephosphorylating the RLC and thus inactivating the motor’. 
Previous studies, with the use of phosphomimetic RLC (T20E/ 
S21E)"*, have reported that MyolI activation during gastrulation®’*” 
and planar polarity’® requires Rok-dependent phosphorylation. 
Phosphomimetic RLC, by definition locked in an active state, results 
in reduced GBE", suggesting that dephosphorylation by phosphatase is 
essential. To test this, we inhibited the RLC-targeting subunit of the 
Myoll phosphatase complex (Mypt-75d (ref. 17)) using RNA interfer- 
ence (RNAi) (Extended Data Fig. 1a). RNAi inhibition of Mypt-75d 
resulted in slower GBE and junction shrinkage (Extended Data 
Fig. 1b, c). Green fluorescent protein (GFP)-tagged wild-type RLC 
revealed that the enrichment of MyolI on vertical versus transverse 
junctions (along the anterior—posterior axis) was 80% in controls, and 
reduced to 30% in Mypt-75d- and MBS-inhibited embryos (Fig. 1a, b 


Figure 1 | Spatial control over RLC phospho- 
cycles is required for MyolI polarity. a, Myoll 
distribution in water and Mypt-75d dsRNA- 
injected embryo. b, Average planar polarity of 
Myoll in different conditions. P values calculated 
with Mann-Whitney U-test. c, Cell shows ‘vertical’ 
and ‘transverse’ junctions. Graph illustrates 
FRAP. Plot shows the mobile fraction of RLC in 
different conditions. Values from each junction 
(open circles) and the average from all 

Bee) junctions (open rectangle) are shown. P values 
“ly. calculated with Mann-Whitney U-test. NS, not 
significant. d, Distribution of MBS-GFP and 
RLC-mCherry. e, Average enrichment of kinase- 
dead Rok (RokKD)-GFP or MBS-GFP relative 
to Myoll. P values calculated with Student’s 
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and Extended Data Fig. 1d, e). A similar distribution was observed for 
GFP-tagged phosphomimetic RLC. Phosphomimetic RLC-GFP is not 
viable in the presence of the null allele of Drosophila RLC"’, and was thus 
co-expressed with mCherry-tagged wild-type RLC (Extended Data Fig. 
2a). In these embryos, a pharmacological Rok inhibitor (40 mM 
Y27632) specifically decreased the recruitment of wild-type RLC, while 
phosphomimetic RLC remained at junctions (Extended Data Fig. 2a, b) 
and exhibited a dose-dependent reduction of planar polarity (Fig. 1b). 
Diminished planar polarity in these conditions was due to higher 
recruitment of MyolI at transverse junctions compared to controls 
(Extended Data Fig. 2e). Conversely, over expression of a constitutively 
active mutant of MBS that lacks the target site regulated by Rok 
(MBSN300)"', reduced Myoll levels at junctions (Extended Data Fig. 
2c-e) indicative of Rok-dependent MBS inhibition on vertical junctions. 
Thus, steady-state levels of MyolI at junctions depend on RLC 
phospho-cycling’* suggesting that its polarized distribution requires 
Rok-dependent recruitment and Myoll phosphatase-dependent 
dissociation. 

We tested this hypothesis by fluorescence recovery after photo- 
bleaching (FRAP). The mobile fraction from FRAP gives an estimate 
of the pool available for exchange (Fig. 1c). Junctional RLC exhibits 
lower turnover at vertical compared to transverse junctions” 
(Extended Data Fig. 3a, c and Fig. 1c). Expression of the constitutively 
active MBS resulted in higher mobile fraction of RLC on all junctions 
(Fig. 1c). Mobile fraction was reduced in transverse junctions in Mypt- 
75d RNAi embryos and was lower than the mobile fraction of vertical 
junctions in controls (Extended Data Fig. 3b, c and Fig. 1c). Similarly, 
the mobile fraction of phosphomimetic RLC became progressively 
lower when the endogenous wild-type RLC was reduced by inhibiting 
Rok (Y27632; Extended Data Fig. 3d). Reduced dephosphorylation 
resulted in higher tension at the transverse junctions as measured by 
focal laser ablation'®'*”°(Extended Data Fig. 3e-g). Together these 
results show that control over the proportion and cycling of phos- 
phorylated RLC determines the polarized stability of MyolI. 

We next determined the distribution of Myoll phosphatase at cell 
junctions. MBS tagged with GFP”? (MBS-GFP; Fig. 1d) showed 30% 
enrichment at vertical compared to transverse junctions (Extended 
Data Fig. 4a). We measured the enrichment of MBS-GFP relative to 
MyolI and compared it to the enrichment of Rok relative to Myoll 
using a GFP-tagged wild-type version” (overexpression; Extended 
Data Fig. 4b) and a kinase-dead version (K116A) which faithfully 
recapitulates Rok localization”’. Rok, like Myoll, is enriched on vertical 
junctions”, and the ratio of wild-type Rok:RLC (Extended Data 
Fig. 4c) and kinase-dead Rok:RLC (Fig. le) was 1 at all junctions, 
showing positive correlation. The ratio of MBS:RLC was greater than 
1 on transverse junctions and less than 1 on vertical junctions (Fig. le), 
indicative of an inverse correlation. Thus, the respective distributions 
of Rok and MBS regulate the stability and dissociation of MyolI at the 
junctions to establish planar polarity (Fig. 1). 

We next asked whether this biochemical network underlying 
polarity of Myoll also explains the temporal dynamics manifested 
in medial pulsatility. It was reported that bypassing upstream regu- 
lation of RLC abolishes Myoll pulsatility'’. However, how pulses 
emerge from the dynamics of MyollI and its upstream regulators 
is unclear. Actomyosin pulses consist of two events: assembly, char- 
acterized by local increase of apical F-actin (visualized by the actin 
binding domain of utrophin, UtrABD-GFP)* and MyolI mini- 
filaments concomitant with coalescence, followed by disassembly 
(Supplementary Video 1 and Fig. 2a). Importantly, MyollI pulses 
are dominated by recruitment rather than densification of mini- 
filaments (85%) (Fig. 2b, h), suggesting that the upstream regulators 
could likewise be pulsatile and entrain Myoll pulsatility. Notably, 
both Rok (wild-type and kinase-dead) and MBS were pulsatile with 
duration comparable to MyolI pulses and amplitude dominated by 
incorporation from the cytoplasm (75%) (Supplementary Video 2, 
Fig. 2c-i and Extended Data Fig. 4d). 
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Figure 2 | MyolI pulses with its regulators Rok, MBS and Rhol GTP. 

a, Medial F-actin densification, Myoll recruitment, coalescence and 
disassembly. b, An example of medial Myoll intensity and apical area time 
trace. c, e, Medial pool of kinase-dead Rok (RokKD)-GFP (c) and MBS-GFP 
(e) co-expressed with RLC::mCherry respectively. d, f, Corresponding 
medial intensity time traces. g, Average pulse amplitude. h, Average per cent 
contribution of recruitment to the pulse amplitude. i, Average duration of 
pulses. j, Biochemical pathway responsible for RLC activation (-P represents 
phosphorylation). k, Medial pool of AniRBD-GFP co-expressed with RLC- 
mCherry. 1, Corresponding medial intensity time traces. p, number of pulses; 
e, number of embryos. Scale bars, 1 jum. 


We postulated an upstream pacemaker hypothesis whereby the 
pulsatility of Rok and MBS is entrained by the molecular switching 
of their regulator Rhol GTPase between active-GTP and inactive- 
GDP forms (Fig. 2j). Indeed, Rhol is required for MyolI activation. 
Injection of the C3 exoenzyme, a Rhol GTP inhibitor”, significantly 
reduced Myoll apical recruitment (Extended Data Fig. 5a—c). Likewise, 
a constitutively active Rhol (RhoV14) led to precocious accumulation 
of MyolI with increased pulse amplitude and duration, and lower fre- 
quency (Supplementary Video 3 and Extended Data Fig. 5d—-h). To view 
the activity of Rho1, we constructed a sensor by tagging the Rhol GTP- 
binding domain of anillin’® (AniRBD) to GFP. AniRBD-GFP was pre- 
sent at apical junctions, exhibited 20% enrichment on vertical junctions 
(data not shown) and was pulsatile (Supplementary Video 4 and 
Fig. 2g-l). C3 exoenzyme injection led to a significant reduction of 
AniRBD-GFP at junctions and loss of medio-apical pulsatility 
(Extended Data Fig. 5a—c) confirming it to be a sensor of Rhol GTP. 

We tested the upstream pacemaker model by doing cross-correla- 
tion analysis between medial intensity time traces of RLC-mCherry 
and either kinase-dead Rok-GFP, MBS-GFP or AniRBD-GFP 
(Fig. 3a). The correlation was strong in all cases (Pearson’s correlation 
coefficients were 0.8 for Rok, 0.7 for MBS and 0.5 for AniRBD). While 
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the intensity peak of MyolII concurred with Rok and AniRBD, MBS 
peaks followed Myoll peaks with an average delay of 2 s (Fig. 3b). The 
temporal delay of MBS raised the possibility that the biochemical 
network is entrained by Rhol to establish pulsatility. Importantly, this 
predicts that Rhol GTP pulses are independent of the activities of 
downstream effectors. However, inhibiting Myoll by inhibition of 
Rok (10 mM Y27632 or 10mM H1152) completely blocked the cor- 
tical recruitment of AniRBD-GFP, kinase-dead Rok-GFP, wild-type 
Rok-GFP and MBS-GEFP, and the pulsatility of cortical F-actin 
(Supplementary Video 5, Fig. 3c-j and Extended Data Fig. 4d). 
These observations are inconsistent with an upstream pace-making 
activity of Rhol and suggest that pulsatility is instead an emergent 
property of the whole network. 

Exploring the initiation of pulses, we found that early GBE (stage 5a) 
is associated with fluctuations of Myoll, Rok and AniRBD (Extended 
Data Figs 5e and 6a, b). Over time (stage 7), the fluctuations assemble 
to form pulses with a nonlinear rate of change (Extended Data Fig. 6c). 
Lack of a detectable time delay between Myoll and AniRBD fluctua- 
tions (Extended Data Fig. 6d) and non-predictable pulse position 
(Extended Data Fig. 6e) suggests that pulsatility involves amplification 
of Rhol-mediated stochastic activation. To understand the nature of 
this amplification, we focused on the cortical dynamics of the network 
components. During pulse formation, contractile MyolI speckles 
move in the plane of the cortex together with F-actin’, a process here- 
after referred to as advection. Advection and concentration of Myoll, 
potentially with molecules that activate Myoll, could create a positive 
feedback that amplifies recruitment of the network components 
(Fig. 4p). We first measured advection using particle image velocime- 
try (PIV)”’ (Supplementary Video 6 and Fig. 4a). The velocity vectors 
from AniRBD, Rok, MBS and F-actin strongly correlated with vectors 
of Myoll (Fig. 4a, b), showing that these are advected together. We next 
calculated the divergence of the vector fields. Actomyosin networks, 
Rhol, Rok and MBS converge (negative divergence) in the centre of 
the cell during pulse assembly (Fig. 4c, d). MyolI contractility drives 
advection as inhibition of MyolI activity blocks F-actin convergence 
(Fig. 4g, h). 

To test the requirement of convergent advection for pulse assembly 
we injected a low dose of cytochalasin D (1 mg ml‘) to disrupt actin 
polymerization. Although the effect on F-actin density was mild, MyolI 
convergence was reduced (Supplementary Video 7 and Fig. 4e, g, h). 
Consistent with convergent advection driving signal amplification 
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Figure 3 | Pulsatility of Rhol GTP, Rokand MBS 
requires Myoll activity. a, Average temporal 
cross-correlation of RLC-mCherry with GFP- 
tagged upstream regulators. Pearson’s correlation 
coefficients (r) are shown. b, Box plot of the time 
delays calculated from the cross-correlation 
analysis. NS, not significant. c-f, Time snapshots 
of different embryos injected with Rok inhibitors 
(¥27632/H1152). g-j, Mean medial intensity time 
traces of various embryos injected with Rok 
inhibitor and control. Error bars, s.d.; P values 
calculated with Mann-Whitney U-test. Scale 
bars, 1 im. 
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(by locally concentrating upstream regulators), the amplitudes of 
both Rok and Myoll pulses were reduced (Fig. 4f). We next sought 
to disentangle the contribution of advection and biochemical recruit- 
ment. Pairwise comparison of RLC pulses with Rhol/kinase-dead Rok 
pulses showed that their amplitudes had a linear relation, suggesting 
feedback between advection and biochemical recruitment (Extended 
Data Fig. 7c). A Rok mutant lacking the Rhol-binding domain (kinase- 
dead RokARBD)”* was pulsatile through advection, but exhibited a 
lower pulse amplitude (Extended Data Fig. 7a, b) and lost the linear 
relation with MyolI pulses (Extended Data Fig. 7c). Thus, convergent 
advection promotes concentration and recruitment by biochemical 
activation resulting in positive feedback. Cytosolic proteins (GFP) or 
low-affinity actin-binding proteins (o-catenin) are not pulsatile 
(Extended Data Fig. 7d, e), indicating that pulsatility requires specific 
recruitment to actomyosin networks. 

We next addressed pulse disassembly and the potential role of RLC 
dephosphorylation. Mypt-75d RNAi and intermediate inhibition 
of wild-type RLC in phosphomimetic -RLC-expressing embryos 
(10 mM Y27632) (Supplementary Video 8 and Fig. 4i) increased pulse 
amplitude and duration, and reduced frequency (Extended Data 
Fig. 8a-c), similar to RhoV14 expression (Extended Data Fig. 5f-h). 
Conversely, expression of constitutively active MBS, led to decreased 
amplitude and duration of Myoll pulses (Fig. 4k, Extended Data Fig. 8a— 
c). Interestingly, reduced dephosphorylation also lowered the advection 
velocities of Myoll and Rok (Extended Data Fig. 8d). A complete block 
of phospho-cycling in the phosphomimetic RLC mutant (40mM 
Y27632) led to failure of convergent advection, thus entirely abolishing 
pulsatility (Supplementary Video 8, Fig. 4j, 1 and Extended Data Fig. 8e, 
f). We conclude that RLC dephosphorylation mediates dissociation 
of MyollI during pulse disassembly and sets the regime whereby 
Myoll can be advected with its regulators and thereby be pulsatile. 

However, a constant ratio of Myoll recruitment and advection, and 
dissociation cannot account for Myoll concentration changes in a 
pulse (Extended Data Fig. 9a). Pulse disassembly requires a negative 
feedback signal that could stem either from a gradual decline of advec- 
tion up to a point where dissociation supersedes advection, and/or a 
delayed increase of MyolI dissociation rate” (as observed in Fig. 3b). 
We estimated MyolI turnover during pulsatility by FRAP in embryos 
co-expressing wild-type RLC-mCherry and wild-type RLC-GFP 
(Extended Data Fig. 9b, e). We photo-bleached the mCherry channel 
and normalized the recovery time trace to the GFP channel (Extended 
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Figure 4 | Pulsatility is an emergent behaviour of a biomechanical network. 
a, Vector fields of UtrABD and RLC averaged over pulse assembly. Inset 
illustrates the principle of cosine similarity. Dotted box illustrates the 
perimeter of a square for calculating divergence as a function of radius (rad.). 
b, Distribution of cosine similarity indices from average vector fields. P values 
calculated with Kolmogorov-Smirnov test. c, Heat map of average 
divergence values. d, Plot shows average divergence values as a function of 
radius. P values calculated with one-sample t-test. e, Time snapshots of 
embryos co-expressing RLC-mCherry and kinase-dead Rok (RokKD)-GFP 
injected either with DMSO or cytochalasin D. f, Average pulse amplitudes in 
these conditions. P values calculated with Mann-Whitney U-test. g, Average 
divergence values in embryos injected with either cytochalasin D or 


Data Fig. 9c, f). During assembly the ratio of mCherry and GFP chan- 
nels remained constant (though at different values) pre- and post- 
bleaching, indicating low turnover (Extended Data Fig. 9b-d, h). 
During disassembly, however, the ratio recovered to ~50% post- 
bleaching, suggesting a higher turnover (Extended Data Fig. 9e-h). 
In addition, we found that MyolIl advection rates gradually declined 
and reached a minimum when Myoll intensity reached a maximum 
(Fig. 4m). What causes this reduction of advection? As in junctions 
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H1152. h, Plot shows average divergence values as a function of radius in 
various conditions. P values in brackets: one-sample t-test; P values above 
asterisks: Mann-Whitney U-test. i, Time snapshots of wild-type embryos 
injected with Mypt dsRNA and phosphomimetic (EE) embryos injected with 
10mM Y27632. j, Time snapshots of phosphomimetic embryos injected 
with 40 mM Y27632. k, Time snapshots of wild-type embryos expressing 
constitutively active MBS. 1, Representative medial intensity time traces in 
these conditions. m, Mean medial Myoll intensity and the corresponding 
advection velocities. n, o, Average advection velocities of Myoll in different 
conditions as a function of time. P values calculated with Mann-Whitney 
U-test. p, Cartoon model of pulse assembly and disassembly. Scale bars, 1 jum. 


(Extended Data Fig. 3f, g), recruitment and densification of medial 
actomyosin networks could increase local friction or tension, thereby 
reducing advection. Indeed, the advection velocity was reduced when 
medial MyolI concentration was high (Extended Data Fig. 8b and 
Fig. 4n), or when F-actin networks were made denser by the expression 
of a constitutively active form of a Rhol-activated actin nucleator 
termed ‘diaphanous’ (lacking the domains responsible for intramole- 
cular inhibition)’ (Supplementary Video 9 and Fig. 4n). Conversely, 
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expression of phospho-dead RLC (T20A/S21A) or cytochalasin D 
injection (1 mg ml ') caused no reduction of advection compared to 
controls (Fig. 40). We conclude that MyolI advection is reduced due to 
densification of actomyosin networks. Notably, MyolI advection 
began to recover during disassembly, though it was associated with 
divergence (Supplementary Video 6, Fig. 4m and Extended Data 
Fig. 9i, j), suggesting mechanical relaxation of the actomyosin net- 
works amplifying Rhol, Rok and Myoll dispersion and depletion. 

Under the regulation of Rhol activity, actomyosin networks exhibit 
three classes of behaviours (Extended Data Fig. 10). When Rhol activ- 
ity is high, MyolI stably binds F-actin and increases network tension 
(vertical junctions or when RhoV14/phosphomimetic RLC is 
expressed or phosphatase is inhibited). When Rhol activity is low, 
Myoll dissociation is high, resulting in lower network tension (trans- 
verse junctions or when constitutively active MBS/phospho-dead RLC 
is expressed). Between these two behaviours, a pulsatile regime 
emerges as a result of feedback between advection and dissociation 
rates: advection-mediated positive feedback amplifies Rhol, Rok 
and Myoll recruitment in a cascade; but actomyosin concentration 
negatively feeds back to reduce advection. Such self-organization of 
biomechanical networks could operate in other systems where acto- 
myosin flows have been observed, such as apical cell constriction, cell 
migration and cytokinesis’. Spatial control over MyolI turnover thus 
provides a potentially general mechanism to drive polarized shape 
changes during tissue morphogenesis. How polarity of these mech- 
anical ratchets is achieved in different tissues still remains unclear. We 
propose that the intrinsic state of actomyosin networks and their 
regulators is a self-organized biomechanical process, which is tuned 
by Rhol activity, regulated in a tissue-dependent manner and fun- 
damental to morphogenesis. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 18 September 2014; accepted 29 May 2015. 
Published online 27 July 2015. 


1. Munro, E., Nance, J. & Priess, J. R. Cortical flows powered by asymmetrical 
contraction transport PAR proteins to establish and maintain anterior-posterior 
polarity in the early C. elegans embryo. Dev. Cell 7, 413-424 (2004). 

2. Kim, H. Y. & Davidson, L. A. Punctuated actin contractions during convergent 
extension and their permissive regulation by the non-canonical Wnt-signaling 
pathway. J. Cell Sci. 124, 635-646 (2011). 

3. Martin, A. C., Kaschube, M. & Wieschaus, E. F. Pulsed contractions of an actin- 
myosin network drive apical constriction. Nature 457, 495-499 (2009). 

4. Solon, J., Kaya-Copur, A. Colombelli, J. & Brunner, D. Pulsed forces timed by a 
ratchet-like mechanism drive directed tissue movement during dorsal closure. 
Cell 137, 1331-1342 (2009). 

5. He,L, Wang, X., Tang, H. L. & Montell, D. J. Tissue elongation requires oscillating 
contractions of a basal actomyosin network. Nature Cell Biol. 12, 1133-1142 
(2010). 

6. Rauzi, M., Lenne, P. F. & Lecuit, T. Planar polarized actomyosin contractile flows 
control epithelial junction remodelling. Nature 468, 1110-1114 (2010). 

7. Kaibuchi, K., Kuroda, S. & Amano, M. Regulation of the cytoskeleton and cell 
adhesion by the Rho family GTPases in mammalian cells. Annu. Rev. Biochem. 68, 
459-486 (1999). 

8. Bertet, C., Sulak, L. & Lecuit, T. Myosin-dependent junction remodelling controls 
planar cell intercalation and axis elongation. Nature 429, 667-671 (2004). 

9. Levayer, R. & Lecuit, T. Biomechanical regulation of contractility: spatial control 
and dynamics. Trends Cell Biol. 22, 61-81 (2012). 


LETTER 


10. Niederman, R. & Pollard, T. D. Human platelet myosin. Il. /n vitro assembly and 
structure of myosin filaments. J. Cell Biol. 67, 72-92 (1975). 

11. Lee, A. & Treisman, J. E. Excessive Myosin activity in mbs mutants causes 
photoreceptor movement out of the Drosophila eye disc epithelium. Mol. Biol. Cell 
15, 3285-3295 (2004). 

12. Kimura, K. et a/. Regulation of myosin phosphatase by Rho and Rho-associated 
kinase (Rho-kinase). Science 273, 245-248 (1996). 

13. Jordan, P. & Karess, R. Myosin light chain-activating phosphorylation sites are 
required for oogenesis in Drosophila. J. Cell Biol. 139, 1805-1819 (1997). 

14. Dawes-Hoang, R. E. et al. folded gastrulation, cell shape change and the control of 
myosin localization. Development 132, 4165-4178 (2005). 

15. Vasquez, C. G., Tworoger, M. & Martin, A.C. Dynamic myosin phosphorylation 
regulates contractile pulses and tissue integrity during epithelial morphogenesis. 
J. Cell Biol. 206, 435-450 (2014). 

16. Kasza, K. E., Farrell, D. L. & Zallen, J. A. Spatiotemporal control of epithelial 
remodeling by regulated myosin phosphorylation. Proc. Nat! Acad. Sci. USA 111, 
11732-11737 (2014). 

17. Vereshchagina, N. et al. The essential role of PP1 beta in Drosophila is to regulate 
nonmuscle myosin. Mol. Biol. Cell 15, 4395-4405 (2004). 

18. Watanabe, T., Hosoya, H. & Yonemura, S. Regulation of myosin II dynamics by 
phosphorylation and dephosphorylation of its light chain in epithelial cells. Mol. 
Biol. Cell 18, 605-616 (2007). 

19. Fernandez-Gonzalez, R., Simoes Sde, M., Roper, J.C., Eaton, S. & Zallen, J. A. Myosin 
|| dynamics are regulated by tension in intercalating cells. Dev. Cell 17, 736-743 
(2009). 

20. Rauzi,M.,Verant,P., Lecuit, T. & Lenne, P. F. Nature and anisotropy of cortical forces 
orienting Drosophila tissue morphogenesis. Nature Cell Biol. 10, 1401-1410 
(2008). 

21. Sen, A., Nagy-Zsver-Vadas, Z. & Krahn, M. P. Drosophila PATJ supports adherens 
junction stability by modulating Myosin light chain activity. J. Cel/ Biol. 199, 
685-698 (2012). 

22. Bardet, P. L. et al. PTEN controls junction lengthening and stability during cell 
rearrangement in epithelial tissue. Dev. Cel/ 25, 534-546 (2013). 

23. Mason, F. M., Tworoger, M. & Martin, A. C. Apical domain polarization localizes 
actin-myosin activity to drive ratchet-like apical constriction. Nature Cell Biol. 15, 
926-936 (2013). 

24. Simdes Sde, M. et al. Rho-kinase directs Bazooka/Par-3 planar polarity during 
Drosophila axis elongation. Dev. Cel! 19, 377-388 (2010). 

25. Nemoto, Y., Namba, T., Kozaki, S. & Narumiya, S. Clostridium botulinum C3 ADP- 
ribosyltransferase gene. Cloning, sequencing, and expression of a functional 
protein in Escherichia coli. J. Biol. Chem. 266, 19312-19319 (1991). 

26. Piekny, A. J. & Glotzer, M. Anillin is a scaffold protein that links RhoA, actin, and 
myosin during cytokinesis. Curr. Biol. 18, 30-36 (2008). 

27. Levayer, R. & Lecuit, T. Oscillation and Polarity of E-Cadherin Asymmetries Control 
Actomyosin Flow Patterns during Morphogenesis. Dev. Cell 26, 162-175 (2013). 

28. Simédes Sde, M., Mainieri, A. & Zallen, J. A. Rho GTPase and Shroom direct planar 
polarized actomyosin contractility during convergent extension. J. Cell Biol. 204, 
575-589 (2014). 

29. Lewis, J. Autoinhibition with transcriptional delay: a simple mechanism for the 
zebrafish somitogenesis oscillator. Curr. Biol. 13, 1398-1408 (2003). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank M. Rao for useful discussions, and the Lecuit and Lenne 
groups and the Labex INFORM (ANR-11-LABX-0054) for support and discussions. We 
are grateful to M. Krahn, A. Martin, V. Mirouse, J. Treisman and J. Zallen for the gift of 
flies. This work was supported by the ANR Archiplast and the ERC (Biomecamorph 
#323027) and HFSP (T.L. and E.M.). A.M. was supported by Ministére de |’Education 
nationale and the Association pour la Recherche contre le Cancer (ARC). We 
acknowledge the France-Biolmaging/PICsL infrastructure (ANR-10-INSB-04-01),. 


Author Contributions This project originated from discussions between T.L. and E.M., 
based on a computational model developed by E.M. A.M. and T.L. planned the project, 
discussed the data with input from E.M., and wrote the paper. A.M. did all the 

experiments and analysis. J.-M.P. made the fly constructs and helped with molecular 
biology. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 

Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to T.L. (thomas.lecuit@univ-amu.fr). 


20 AUGUST 2015 | VOL 524 | NATURE | 355 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


METHODS 

Transgenic Lines. MyolI RLC is encoded by the gene spaghetti-squash (sqh, 
Genebank ID: AY122159)°°. For generating GFP-tagged phosphomimetic Sqh 
(phosphomimetic RLC-GFP) and phospho-dead Sqh (phospho-dead RLC- 
GEP), sqh-mCherry plasmid under its native promoter (kind gift from 
A. Martin)’ was modified by replacing mCherry (ATG-+TAA) to monomeric 
eGFP (A206—K). Corresponding codons of the amino acids Thr20 and Ser21 
on the sgh gene were modified to Glu20 and Glu21 (phosphomimetic) or Ala20 
and Ala21 (phospho-dead) using the Quick-Change Mutagenesis Kit (Stratagene). 
Recombinant expression vectors were sent to BestGene Inc. for random insertion 
transgenesis. 

To construct the Rhol GTP sensor (AniRBD-GFP), monomeric eGFP was 
fused to the C-terminal end of Drosophila anillin (amino acids 748-1239, 
Genebank ID: AAL39665) that lacks the N-terminal myosin- and actin-binding 
domain but retains its Rhol-binding domain. A GSSGSS linker was added in 
between GFP and anillin. This sensor is similar to the one used in human cells”. 
The transgene is expressed under the ubiquitin p63E promoter. For cloning, Ubi- 
GFP::AniRBD ORF replaced the stop-CD8GFP in the Ubi-stop-mCD8GFP plasmid 
obtained from Stepan Belyakin, Russia (GeneBank ID: KC845568). Recombinant 
expression vector was sent to BestGene Inc. for PhiC31 site-specific mediated 
insertion transgenesis. 

All FASTA sequences of the complete vectors are available on request. 

Fly constructs and genetics. Fly (Drosophila melanogaster) constructs and gen- 
etics are listed below, where asterisks denote UAS (upstream activating 
sequence) males crossed with maternal transcription 67gal4-expressing females 
(mattubgal4VP16). 

Wild-type RLC-GEP distribution and dynamics in various conditions. (1) yw cv 
sqhAX3;sqh-sqh::GFP;+ (a gift from R. Karess) for water, Mypt dsRNA, cytocha- 
lasin D and DMSO injections and PIV in Figs 1a-c, 2b, 4i, ], m-o, Extended Data 
Figs Ic, 2e, 3, 6e, 8a-c, e, f, 9i, j and Supplementary Videos 6 and 8. (2) F2 progeny*: 
male UAS-MBS shRNA crossed with female +;67Gal4,sqh-sqh::GFP;+ (UAS- 
MBSshRNA is a Bloomington Stock 41625) in Extended Data Fig. 1d, e. (3) F2 
progeny*: male UAS-CA-MBS crossed with female +;67Gal4,sqh-sqh::GFP;+ 
(UAS-CA-MBS/MBSN300 was a gift from J. Treisman) in Fig. 4k and Extended 
Data Figs 2c-e and 8a-c. (4) Fl progeny*: male UAS-RhoV14 crossed with female 
+;67Gal4,sqh-sqh::GFP;+ (UAS-RhoV14 is a Bloomington Stock 7330) in 
Extended Data Fig. 5d, e and Supplementary Video 3. (5) Fl progeny*: male 
UAS-diaCA NC crossed with female +;67Gal4,sqh-sqh::GFP;+ (UAS-diaCA NC 
was a gift from P. Rorth) in Figure 4n and Supplementary Video 9. 
Phosphomimetic RLC (EE-RLC)-GFP-expressing embryos. Phosphomimetic- 
RLC-expressing embryos: sqh-sqhEE::GFP;+;+ (see transgenic lines) compared 
to wild-type embryos: +;sqh-sqh::GFP;+ (a gift from R. Karess) in Fig. 1b, 4i,j, 1, n, 
Extended Data Figs 2e, 3d-g, 8a-c, e, f and Supplementary Video 8. 

Rok inhibition in embryos co-expressing phosphomimetic RLC-GEP and wild-type 
RLC-mCherry. + ;sqh-sqh::GEP, sqh-sqh::Cherry;+ compared with sqh-sqhEE::GFP; 
sqh-sqh::Cherry;+ (sqh-sqh::Cherry was a gift from A. Martin) in Extended Data 
Fig. 2a, b. 

FRAP of phosphomimetic RLC-GEP- and wild-type RLC-GFP-expressing embryos. 
Four copies: +;sqh-sqh::GFP;+ compared to sqh-sqhEE::GFP;+;+ in Extended 
Data Fig. 2e. Two copies: yw cv sqhAX3;sqh-sqh::GFP;+ compared to sqh- 
sqhEE::GFP;+;+ injected with 40 mM Y27632 in Extended Data Fig. 2e. 
UtrABD::GFP dynamics. yw cv sqhAX3;sqh-UtrophinABD::GEP/Cyo;sqh-sqh:: 
mCherry in Figs 2a, 3f, j and 4a-c, g, h. Fl progeny*: male UAS-diaCA NC crossed 
with female +;67Gal4/sqh-UtrophinABD::GEP;sqh-sqh::mCherry/+ in Supple- 
mentary Video 9. 

MBS-GFP distribution and dynamics. F2 progeny*: male UAS-MBS::GFP crossed 
with female yw cv sqhAX3;67Gal4,sqh-sqh::Cherry;+ (UAS-MBS::GFP was a gift 
from M. Krahn) in Figs 1d, e, 2e-i, 3a, b, e, i, 4b, d, Extended Data Fig. 4a and 
Supplementary Videos 2 and 5. 

RokKD-GEP distribution and dynamics. (1) +;sqh-sqh::mCherry;sqh-RokKD::GFP 
(sqgh-RokKD::GFP was a gift from J. Zallen) in Figs le, 2c, d, g-i, 3a, b, d, h, 4b, d-f, 
Extended Data Figs 6a, c, 7c, 8d and Supplementary Videos 2 and 5. (2) Fl 
progeny*: male UAS-RhoV14; crossed with female 67Gal4,sqh-RokKD::GFP/+; 
sqh-sqh::Cherry/+ (sqh-RokKD::GFP was a gift from J. Zallen) in Extended Data 
Fig. 5f-h. 

Wild-type Rok-GFP distribution and dynamics. +;sqh-sqh::Cherry; Ubi-Rok::GEP/ 
Tm6 (Ubi-Rok::GFP was a gift from V. Mirouse) in Extended Data Fig. 4b-d. 
Rhol GTP sensor distribution and dynamics. +;Ubi-AniRBD::GFP,sqh-sqh:: 
Cherry;+ (see transgenic lines) in Figs 2g-l, 3a-c, g, 4b, d, Extended Data Figs 
5a-c, 6b-d, 7c and Supplementary Videos 4 and 5. 

Cytosolic GFP dynamics. F1 progeny*: male UAS-GFP crossed with female yw cv 
sqhAX3;67Gal4, sqh-sqh::Cherry;+ (UAS-GEFP is a Bloomington Stock 30140) in 
Extended Data Fig. 7d, e. 


a-Catenin-YFP dynamics. +;+;Endo-a-Catenin::Y FP,sqh-sqh::mCherry (a-Catenin:: 
YFP is a Kyoto Stock) in Extended Data Fig. 7d, e. 

RokKDARBD- Venus and control dynamics. F2 progeny*: male UAS-RokK116A 
crossed with female yw cv sqhAX3;67Gal4,sqh-sqh::Cherry;+; and male UAS- 
RokK116AARBD crossed with yw cv sqhAX3;67Gal4,sqh-sqh::Cherry;+ (gifts from 
J. Zallen) in Extended Data Fig. 7a-c. 

Phospho-dead RLC (AA-RLC)-GFP-expressing embryos. +;+;sqh-sqhAA::GEP 
compared with: +;sqh-sqh::GFP;+ (see transgenic lines) in Fig. 40. 

Two-colour FRAP. +;sqh-sqh::GFP,sqh-sqh::Cherry;+ in Extended Data Fig. 9b-h. 
RNA interference. A double-stranded RNA probe against Mypt-75d was made 
using PCR product containing the sequence of the T7 promoter (TAATACG 
ACTCACTATAGGG) followed by 21 nucleotides specific to the gene. 
Sequences were amplified between nucleotides 3,133-3,517 (GeneBank ID: 
AY075426). PCR products were subsequently used as a template for the in vitro 
RNA synthesis. 0-1-h-old embryos were injected using 5 uM dsRNA diluted in 
RNase-free water. 

For knockdown of MBS, UAS-driven small hairpin RNA (shRNA) was 
expressed as described in fly constructs and genetics. 

Real-Time PCR. RNAi efficiency was estimated by measuring endogenous 

mRNA levels using quantitative RT-PCR after injection with dsRNA probe 

against Mypt-75d. Total RNA extraction from early gastrulating embryos was 

performed using the miRNeasy micro kit (QIAGEN, no. 217084) with an addi- 

tional DNase treatment step to remove genomic DNA contamination. Reverse 

transcription was performed with the iScript Reverse Transcription Supermix 

(Bio-RAD, no. 170-8841) according to the manufacturer’s protocol. Real-time 

PCR was performed on a CFX96 QPCR detection system (Bio-RAD) with the 

following TaqMan probes (TaqMan Gene Expression Assays from Life 

Technologies) plus TaqMan Gene expression master mix, following classical 

TaqMan protocol: 

¢ MYPT-75D: Dm01806472_g1 = probe 1 (exon 7-8 boundary, 1983/ 
GenBank AY075426, amplicon = 142 bp) and Dm01806468_g1 = probe 2 
(exon 3-4 boundary, 948/GenBank AY075426, amplicon = 98 bp). 

e House-keeping gene referent RPL32: Dm02151827_g1 (exon 2-3 boundary, 
948/GenBank AY070656, amplicon = 72 bp) 

¢ House-keeping gene referent ACT42A: Dm02362162_s1 (exon 2, 948/ 
GenBank AY118907, amplicon = 108 bp) 

e House-keeping gene referent RPII140: Dm02134593_g1 (exon 2-3 boundary, 
2331/GenBank BT003265, amplicon = 78 bp) 

Quantitative RT-PCR conditions were as follows: 40 cycles of 95 °C for 15s and 
60°C for 60s. Analyses were performed in triplicate from three independent 
experiments. After comparison of three house-keeping genes, transcript levels 
were first normalized to one, Rpl-32; and then to their respective control group. 
AAC, method was used to estimate relative amounts using the Bio-RAD CFX 
manager software. 

A threefold depletion of Mypt-75d transcripts was observed in Mypt-75d RNAi 

embryos compared to control embryos with both Taqman probes. P values were 
calculated with unpaired t-tests using qBasePLUS software (Biogazelle). 
Drug injections. Y27632 dihydrochloride (Sigma) was injected in stage 5a 
embryos expressing phosphomimetic RLC at 40 mM for strong inhibition, and 
at 10 mM for weaker inhibition of endogenous RLC or wild-type RLC-mCherry. 
For wild-type embryos, inhibition was strong at 10 mM concentration. Inhibition 
of Myoll by Y27632 (10 mM) was comparable to another Rok inhibitor, namely 
H-1152 (Enzo Life Science) at 10 mM. 

C3 exoenzyme (Enzo Life Sciences) was injected into stage 5a embryos at 
2pgul'. 

Cytochalasin D (Sigma) suspended in DMSO was injected into stage 5a 

embryos at 1 mgml~' concentration diluted from a 10mg ml7' stock. Corres- 
ponding dilution of DMSO was made for control injections. 
Image acquisition. Embryos were prepared as described before*’. Time-lapse 
imaging was done from stage stage 5a for observing medial fluctuations during 
early GBE and from stage 7 to stage 8 for all other experiments (5-30 min depend- 
ing on the experiment). A Nikon spinning disc Eclipse Ti inverted microscope 
using a 100X, 1.4N.A oil-immersion objective at room temperature (~22 °C) was 
used. The system acquires images using the MetaMorph software. For junctional 
and medial intensity measurements, starting from the most apical plane, seven 
z-sections, 0.5 jim apart, were acquired every 4 or 5 s using a single camera. For 
performing temporal cross-correlation analysis on medial intensity time traces, 
three apical z-sections 0.75 1m apart were acquired every 2 s using a single camera. 
For visualizing UtrABD-GFP, only three apical z-sections 0.5 um apart were 
acquired. Laser power was measured and kept constant between experiments. 

FRAP was performed using the targeted laser of the Nikon spinning disc 
(iLAS?). A constant junctional area was photo-bleached (0.91 um diameter) for 
every experiment. 
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Laser ablation. Nano-ablation of junctions was performed on an inverted micro- 
scope (Eclipse TE 2000-E, Nikon) with a spinning disk (Ultraview ERS, Perkin 
Elmer) using a 100X, 1.4 N.A. oil-immersion objective. A home-built system was 
used as described previously”, using a focused laser beam exposed for 5-10 ms 
with a power between 800-950 mW depending on the experiment. Images were 
acquired every 200 ms to get a better temporal resolution. 

Image analysis. All image processing was done in Fiji freeware software. For all 
quantifications, sum-intensity z-projection of slices was used, followed by a back- 
ground subtraction using the available plugin in Fiji (‘subtract background’, ball 
radius 30 pixels, 1 pixel = 0.13 jum). 

For junctional intensity and planar polarity measurements, regions were drawn 
manually (width 6 pixels). Junctions were categorized based on angles (0-15° 
corresponding to transverse and 75-90° corresponding to vertical as shown in 
Fig. 1c). For plotting planar polarity, intensity from each vertical junction was 
normalized to the mean of transverse junctions in a given embryo. These values 
were averaged per embryo. The means of the averaged values were taken to plot 
Fig. 1b, Extended Data Fig. le and Extended Data Fig. 4a. For comparing junc- 
tional intensity values in Extended Fig. 2, intensity from each junction was nor- 
malized to the mean of the junctions of the respective control embryos. The means 
of the normalized values were plotted in all graphs of Extended Data Fig. 2. 

For pulsatility measurements, individual cells were tracked using CellTrack1.1 
(open source software link: http://bio.cse.ohio-state.edu/CellTrack/). The software 
tracks cells using vertices as points of reference. The procured region was shrunken 
using a constant thickness (6 pixels) to exclude contribution from junctional 
signals. Medial intensity time traces were smoothed to automatically calculate 
pulse amplitude and duration using Igor Pro (Wavemetrics). To calculate normal- 
ized pulse amplitude and per cent recruitment of MyolI and upstream regulators 
(Fig. 2g, h), mean intensity and total intensity time traces were normalized to their 
minimum values respectively (base value of pulses) and treated as described above 
to extract amplitudes. Values plotted in Fig. 2g are amplitudes from normalized 
mean intensity time traces. Total intensity pulse amplitudes were subtracted from 
the respective mean intensity pulse amplitudes to calculate the per cent contri- 
bution of densification as a result of apical area reduction (as depicted in Fig. 2b). 
Pulse amplitudes reported in all other figures are calculated from mean intensity 
time traces. 

The first derivative of increase in intensity during pulse assembly was calculated 
in Igor using normalized and smoothed mean intensity time traces (Extended 
Data Fig. 6c). 

To measure the location of pulses, the Fiji function ‘centre of mass’ was used to 
get the coordinates of pulse coalescence in a region of interest with constant size 
(60 by 60 pixels, Extended Data Fig. 6e). 

FRAP quantification. Bleached region was tracked using a home-made Fiji 
macro. Measurements were done only on those intensity time traces that reached 
a plateau. Intensity at each time point was normalized as follows to scale the 
intensity time trace between 0 and 1 
I, 7x Tin _ 
Tnax = Emin . 
Where: Imax» pre-bleach intensity; [,,in, bleached intensity; I,, intensity at time f; 
and y, normalized intensity. 

The normalized recovery trace was fit to a single exponential in Origin 

(version 8) to obtain the mobile fraction as follows 


y=A(l—e “) 


Where: A, mobile fraction (end value of the recovered intensity); 1, fitted para- 
meter; t, time after bleaching. 

In two-colour FRAP experiments, the ratio of medial Myoll in red and green 
channels was used to do the above normalization (Extended Data Fig. 9b-h). 
Laser ablation quantification. Recoil of the junction after ablation was measured 
by tracking its vertices for 2 s. A linear fit was used to calculate the recoil velocity. 
Temporal cross-correlation analysis. Temporal cross-correlation of medial 
intensity time traces from RokKD, MBS and AniRBD with MyolI RLC were done 
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in Igor Pro, as described previously”, to obtain the Pearson’s correlation coef- 
ficient (r) and time delay. 

PIV analysis. PIV analysis was performed using the MATPIV toolbox in 
MATLAB as described previously”’, and the vectors were averaged over time to 
obtain average vector fields. The analysis was restricted to a central region of the 
cell. The ROI was usually 60 pixels by 60 pixels (1 pixel = 0.13 pm) to catch 
significant medial pool and to avoid perturbation from the junctional signal. 
For PIV of UtrABD-GFP + RLC-mCherry, images were rescaled to get a pixel 
size of 0.16 jum (instead of 0.08 1m). For comparing the direction of the vectors in 
two channels (as depicted in Fig. 4a), the cosine similarity index was calculated 
from vector fields averaged over one pulse cycle (100-120 s) for each channel (red 
(R) or green (G)) as follows 


RG Ri x G; 
|R| | G| 


Cosine similarity 


The resultant value ranges between —1 (opposite angles) to 0 (independent 
angles) to +1 (same angle). As a control, images were taken from sub-apical 
sections (1.5 jum deep) of red (RLC-mCherry) and green (UtrABD-GFP) chan- 
nels to estimate the contribution of junctional drift to the similarity index. 

For calculating divergence, averaging of displacement vectors was done either 
during pulse assembly (duration 0-~60 s) or disassembly (~90-120 s). MATLAB 
‘div’ function was used to calculate divergence from the average vector fields. The 
heat maps in Fig. 4c, g and Extended Data Figs 8e and 9i are average divergence 
values from many cells (101m X 10pm). The divergence in Fig. 4d, h and 
Extended Data Figs 8f and 9j were calculated by averaging the divergence values 
on a perimeter of a square expanding from the centre (0) up to a radius of 5 um 
(as depicted in Fig. 4a). 

For calculating advection velocities during the course of the pulse, PIV was 
performed on a pulse cycle divided into bins of 20s. Integration of displacement 
vectors was done for each bin (20s), the norms of the resulting vectors were 
averaged in a given region of interest and were then divided by the total time to 
get a single value for ‘advection velocity’ per bin. The advection velocities in 
Fig. 4m are normalized to the minimum value which coincides with the maximum 
value of the normalized medial Myoll intensity. 

Statistics. Average values are calculated from ‘n’, where n is the number of junc- 
tions/cells/pulses as noted in the figure. This number was used for conducting 
statistical significance tests. In planar polarity measurements (Fig. 1b, Extended 
Data Figs le and 4a), n is the number of embryos. Error bars usually depict s.e.m. 


(that is, a where m= number of embryos, unless otherwise specified). For 
box plots, the open square indicates the mean, crosses indicate 99% or 1%, dashes 
are maximum or minimum and error bars are s.d. For large data sets (n > 100) 
such as junctional intensity measurements, two-sample Student’s t-tests (two- 
tailed distribution with unequal variance) were used to determine P values. A t-test 
was performed only if the distribution was normal as tested with a Kolmogorov- 
Smirnov test. For small data sets (1< 100) and non-normal distributions, 
non-parametric Mann-Whitney U-tests were used to determine P values. For 
calculating the significance of difference of divergence values from 0, one-sample 
t-tests were used (null hypothesis: mean = 0). For comparing the distribution of 
cosine similarity indices with controls, two-independent-samples Kolmogorov- 
Smirnov tests were used. All statistical tests and box plots were made in OriginPro. 
No experiments were randomized and the investigator was not blinded to any data 
set. No statistical methods were used to predetermine sample size. 

Repeatability. All measurements were made from 1-10 embryos from experi- 
ments replicated 2-10 times. Images and videos are representative of the data set. 


30. Karess, R. E. et al. The regulatory light chain of nonmuscle myosin is encoded by 
spaghetti-squash, a gene required for cytokinesis in Drosophila. Cell 65, 
1177-1189 (1991). 

31. Cavey, M. & Lecuit, T. Imaging cellular and molecular dynamics in live embryos 
using fluorescent proteins. Methods Mol. Biol. 420, 219-238 (2008). 
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Extended Data Figure 1 | RNAi specifically downregulates MyolI 
phosphatase resulting in cellular and tissue-level defects. a, Gene expression 
data of Mypt-75d measured from quantitative PCR in water- and dsRNA- 
injected embryos using two different Taqman probes. P values calculated with 
unpaired t-test. b, Distribution of percentage of GBE in water and Mypt 
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dsRNA- injected embryos. c, Box plot of time taken for junction shrinkage in 
water and Mypt dsRNA-injected embryos. d, Myoll distribution in control 
(left) and MBS shRNA-expressing embryo (right). e, Average planar polarity of 
Myoll in these conditions. P values calculated with Mann-Whitney U-test. 
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Extended Data Figure 2 | RLC phospho-cycling determines quantitative Student’s t-test. c, Distribution of Myoll in control embryos or in embryos 
levels of MyolI at cell junctions. a, Distribution of MyolI in embryos co- expressing constitutively active MBS (CA-MBS). d, Average junctional 


expressing wild-type (WT) RLC-GFP with wild-type RLC-mCherry (top) and _ intensity of Myoll in these conditions. P values calculated with Student'’s t-test. 
phosphomimetic RLC-GFP with wild-type RLC-mCherry (bottom) injected —_ e, Average intensities of MyolI on transverse junctions in various conditions. 
with water (left) or Y27632 (right). b, Mean intensity of Myollin mCherry (top) P values calculated with Mann-Whitney U-test. Scale bar, 5 jum. 

and GFP (bottom) channels in these conditions. P values calculated with 
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Extended Data Figure 3 | Proportions and cycling of phosphorylated RLC 
regulates MyolI turnover and mechanical tension. a, b, Mean fluorescence 
recovery time traces (normalized as described in methods) of RLC-GFP 
from vertical (red) and transverse junctions (blue) in wild-type (a) and 
Mypt-75d RNAi (b) embryos. The dotted lines are fits derived from the 
equation described in the methods. c, Table shows the goodness of fit (R?) and 
the respective mobile fraction from fitted curves in a and b. d, Plot shows 

the mobile fraction of RLC in different conditions. Values from each junction 


(open circles) and the average from all junctions (open rectangle) are shown. 
e, Cartoon showing scission of junctions and their elastic recoil in response 

f, Vertical and transverse junctions before (0 s) and after (5 s) laser ablation in 
different conditions. g, Recoil velocities of vertical and transverse junctions 
in different conditions. Values from each junction (open circles) and the 
average from all junctions (open rectangle) are shown. P values calculated 
with Mann-Whitney U-test. Scale bar, 1 um. 
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Extended Data Figure 5 | Rhol GTP is required for MyolI activation. RLC-GFP in control and embryo expressing constitutively active Rho 
a, Representative images of AniRBD-GFP (left) and RLC-mCherry (right) in (RhoV14). Scale bar, 1 um. e, Mean medial intensity time traces of RLC-GFP in 
water (top) and C3 exoenzyme (bottom) injected embryos. Scale bar, 5 um. different conditions and stages (Error bars: Standard deviation) f, Average 


b, AniRBD-GFP and RLC-mCherry mean junctional intensities in water and _ pulse amplitudes of RLC-mCherry and kinase-dead Rok (RokKD)-GFP in 
C3 exoenzyme injected embryos. P values calculated by Student’s t-test.c, Mean _ control and RhoV14-expressing embryos g, h, Average duration and frequency 
medial intensity time trace of AniRBD-GFP (left) and RLC-mCherry (right) in of Myoll pulses in control and RhoV14-expressing embryos. P values 

C3 exoenzyme injected embryos compared to a representative water injected calculated by Mann-Whitney U test. 

embryo. Error bars represent s.d. d, Time snapshots showing medial pool of 
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RokKD-Venus (top) or ROKKDARBD-Venus (bottom). b, Mean pulse a-catenin-YFP (bottom). e, Mean medial intensity time traces of RLC- 
amplitudes of RokKD-Venus and RokKDARBD-Venus. ¢, Pairwise mCherry, cytosolic GFP and o-catenin-YFP during GBE. Scale bar, 1 pm. 
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Extended Data Figure 8 | RLC dephosphorylation is required for pulse U-test. e, Average divergence values from many cells of embryos expressing 
disassembly. a—c, Mean Myoll pulse duration (a), amplitude (b) and wild-type RLC-GFP (left), phosphomimetic (EE) RLC-GFP (middle) and 
frequency (c) in various conditions. P values calculated with Mann-Whitney —phosphomimetic RLC-GFP injected with Y27632 (40 mM) (right). f, Plot 
U-test. d, Average advection velocity of medial MyolI and RokKD in wild-type showing average divergence values as a function of cell radius in these 

and mypt dsRNA-injected embryos. P values calculated with Mann-Whitney conditions. P values calculated with one-sample t-test. 
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Extended Data Figure 9 | Non-constant turnover and advection results in 
Myoll depletion during pulse disassembly. a, Behaviour of Myoll (green) for 
different ratios of recruitment + advection/dissociation (black) (dashed line, 
ratio >1; dotted line, ratio = 1; dot—dash line, ratio <1; solid line: non-constant 
ratio starting from >1 and crossing over to <1). b, e, Medial intensity time 
traces of embryo co-expressing RLC-mCherry and RLC-GFP. The star shows 
the time of photo-bleach either at the onset of pulse assembly (b) or pulse 
disassembly (e). ¢, f, Time traces show the ratio of intensities from the red and 
the green channels (black). d, g, Fluorescence recovery curves obtained from 


LETTER 


the red/green ratio. The curves are normalized as described in the methods. 
h, Average of fluorescence recovery curves from many different experiments. 
Error bars, s.e.m. where nm = number of pulses. The dotted curve corresponds to 
the fit derived from the equation described in the methods. Table shows the 
goodness of fit (R) and the respective mobile fraction during pulse assembly 
and disassembly. i, Average divergence values of Myoll during pulse 
assembly (left) and disassembly (right). j, Plot shows average divergence values 
as a function of cell radius during pulse assembly and disassembly. P values 
calculated with one-sample t-test. 
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Extended Data Figure 10 | A qualitative phase diagram of MyolI dynamics. The phase diagram summarizes Myoll behaviour (stable or oscillatory) in the 
parameter space of Myoll dissociation and Myoll activity in different experimental conditions. 
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The zinc transporter ZIP12 regulates the pulmonary 
vascular response to chronic hypoxia 
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The typical response of the adult mammalian pulmonary circulation 
to a low oxygen environment is vasoconstriction and structural 
remodelling of pulmonary arterioles, leading to chronic elevation 
of pulmonary artery pressure (pulmonary hypertension) and right 
ventricular hypertrophy. Some mammals, however, exhibit genetic 
resistance to hypoxia-induced pulmonary hypertension’*. We used 
a congenic breeding program and comparative genomics to exploit 
this variation in the rat and identified the gene SIc39a12 as a 
major regulator of hypoxia-induced pulmonary vascular remod- 
elling. Slc39a12 encodes the zinc transporter ZIP12. Here we 
report that ZIP12 expression is increased in many cell types, 
including endothelial, smooth muscle and interstitial cells, in 
the remodelled pulmonary arterioles of rats, cows and humans 
susceptible to hypoxia-induced pulmonary hypertension. We show 
that ZIP12 expression in pulmonary vascular smooth muscle cells 
is hypoxia dependent and that targeted inhibition of ZIP12 inhi- 
bits the rise in intracellular labile zinc in hypoxia-exposed pulmo- 
nary vascular smooth muscle cells and their proliferation in 
culture. We demonstrate that genetic disruption of ZIP12 express- 
ion attenuates the development of pulmonary hypertension in rats 
housed in a hypoxic atmosphere. This new and unexpected insight 
into the fundamental role of a zinc transporter in mammalian 
pulmonary vascular homeostasis suggests a new drug target for 
the pharmacological management of pulmonary hypertension. 
We have reported previously that the Fisher 344 (F344) rat strain is 
resistant to hypoxia-induced pulmonary hypertension compared with 
the Wistar Kyoto (WKY) strain’. Linkage analysis of an F, population 
derived from inbred WKY X F344 rats identified a quantitative trait 
locus (QTL) on chromosome 17 (ref. 2). On the basis of this obser- 
vation, we next conducted ten successive microsatellite-guided back- 
crosses of offspring with WKY rats and derived two congenic strains in 
which the original QTL was dissected and represented as partly over- 
lapping regions of a donor F344 genome interposed onto the genetic 
background of the WKY recipient strain (Extended Data Figs 1 and 2). 
Resistance to hypoxia-induced pulmonary hypertension was detected 
in one of the congenic strains (R47A; Fig. la-d and Extended Data 
Fig. 2). Three subcongenic strains (SubA, SubB and SubC) were 
derived by further backcrosses of R47A onto the WKY background 
and the congenic interval was fine-mapped to a region of 8.28 mega- 
base pairs (Mbp) containing an estimated 65 genes (rat chromosome 17: 
positions 85,072,475-93,347,784) (Fig. 1 and Extended Data Fig. 2a). 
Whole-genome sequencing (greater than 20x coverage) of the WKY 
and F344 parental strains* revealed 13 non-synonymous coding SNPs 
affecting 9 genes within the refined congenic interval, and 6 indels 


resulting in frameshift mutations in 4 genes (Extended Data Table 1). 
Polymorphic examination of the 13 SNPs and 6 indels in 2 additional 
rat strains susceptible to hypoxia-induced pulmonary hypertension 
(the spontaneously hypertensive and fawn-hooded rat strains, 
respectively) excluded 5 SNPs and 5 indels and narrowed the genes 
of interest to 7 (Slc39a12, St8sia6, Cubn, Nmt2, Dclrelc, Hspal4 and 
Cdnf; Fig. le and Extended Data Table 1). Further polyphen analysis 
allowed us to exclude five listed genes (St8sia6, Cubn, Nmt2, Delre1c 
and Cdnf) as the non-synonymous coding changes were predicted to 
be benign. We identified Slc39a12, with a loss of thymidine at position 
88,575,534 leading to a frameshift mutation in exon 11, as the highest 
priority candidate gene for further investigation. 

Slc39a12 encodes the solute carrier 39 zinc transporter family 
(ZIP1-14) member 12 (ZIP12) and has high specificity for zinc’. 
The ZIP family tightly regulates cellular zinc homeostasis in numerous 
cell types by promoting zinc uptake from the extracellular space or 
release from intracellular compartments. The rat Slc39a12 gene con- 
tains 12 exons and the ZIP12 protein comprises 688 amino acids with a 
secondary structure comprising 8 transmembrane domains. In the 
F344 strain, the frameshift mutation in Slc39a12 introduces a stop- 
codon predicting a carboxy (C)-terminal truncated ZIP12 protein of 
553 amino acids (Extended Data Fig. 3a). This affects the conserved 
zinc transporting aqueous cavity between transmembrane domains IV 
and V, resulting in the loss of the metalloprotease motif (HEXPHE), 
which would be expected to lead to a reduction in zinc transport®. 

A pathognomonic histological signature of chronic hypoxia-induced 
pulmonary hypertension is thickening of the pulmonary vascular 
media (owing to hyperplasia and hypertrophy of smooth muscle cells) 
and the muscularization of previously unmuscularized pulmonary 
arterioles’. We found that ZIP12 messenger RNA (mRNA) levels were 
very low and ZIP12 protein was undetectable by immunohistochem- 
istry in the pulmonary vasculature of adult WKY rats housed in 
a normal oxygen atmosphere, but WKY rats exposed to hypoxia 
showed markedly increased lung ZIP12 mRNA levels and pronounced 
ZIP 12 expression in remodelled pulmonary arterioles (Fig. 2a, b and 
Extended Data Fig. 3b). ZIP12 expression was evident in vascular 
smooth muscle but also other cell types (endothelial and interstitial 
cells) known to contribute to structural changes seen in hypoxic lungs. 
In contrast, and consistent with a frameshift mutation in Slc39a12 
predicting a C-terminal truncated protein, ZIP12 was undetectable 
with an antibody directed at the C terminus of the protein in the lungs 
of chronically hypoxic F344 rats (Fig. 2b and Extended Data Fig. 3b). 

Slc39a12 is highly conserved across species* and transcribed 
constitutively in many tissues (http://www.biogps.org). To investigate 
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Figure 1 | The pulmonary vascular response to hypoxia in the F344 rat is 
influenced by a region of chromosome 17 containing Slc39a12. a, A genetic 
map of three sub-congenic strains (SubA, SubB and SubC) derived from the 
R47A congenic strain (originally derived from a WKY X F344 cross) 
backcrossed with the WKY parental strain. The refined congenic region 
(orange) of 8.28 Mbp containing 65 genes is within the SubB strain. b-d, SubB 
exhibits attenuated pulmonary hypertension after 2 weeks exposure to a 10% O2 
atmosphere compared with WKY, SubA and SubC rats: b, mean pulmonary 
artery pressure (mPAP); ¢, right ventricular hypertrophy (ratio of mass of right 
ventricle to left ventricle plus septum, RV/LV + sep.) (n = 17 WKY, 15 F344, 
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Figure 2 | Slc39a12 encodes a zinc transporter, ZIP 12, which is upregulated 
in pulmonary vascular tissue from mammals exposed to chronic hypoxia. 
a, ZIP12 mRNA levels in control and hypoxic WKY rat lungs (n = 6 per group). 
b, Prominent ZIP12 immunostaining in remodelled pulmonary arterioles in 
WKY (red arrows) but not F344 rat lungs exposed to hypoxia. c, No ZIP12 
staining was detected in pulmonary arteries of low-altitude (normoxia control, 
CO calf) calves and sea-level humans, yet prominent ZIP12 immunostaining 
was observed in the remodelled pulmonary arteries of calves with severe 
pulmonary hypertension (Hx calf), in cattle naturally susceptible to pulmonary 
hypertension at altitude (‘Brisket disease’, BD) as well as Kyrgyz highlanders 
residing above 2,500 m. d, Design of the luciferase reporter vector pGL4.10 
containing a 5’ region of ZIP12 that includes an HRE encoding for both 
HIF-1- and HIF-2«-binding motifs or a mutant HRE sequence where the 


14 R47A, 8 SubA, 10 SubB, 10 SubC); d, vascular muscularization (n = 6 per 
group). Dotted line indicates mean measurements from all the rats in a normal 
oxygen atmosphere (21% O2; mPAP = 14.7 + 0.3 mm Hg; right ventricular 
hypertrophy = 0.270 + 0.004; percentage muscularization = 34.2 + 0.36; for 
actual values in rat strains see Extended Data Fig. 2c—e). Values are expressed as 
mean + s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001 compared with WKY after 
one-way analysis of variance (ANOVA) followed by Bonferroni correction for 
multiple testing. e, The genes of interest (Slc39a12, St8sia6, Cubn, Nmt2, Delre Ic, 
Hspal4 and Cdnf) identified within the SubB congenic interval. The frameshift 
mutation in Slc39a12 introduces a stop-codon, resulting in a truncated protein. 
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5'-ACGTG-3' motif has been replaced by 5'-AGCAG-3’(mHRE). e, Human 
pulmonary artery smooth muscle cells (HPASMCs) transfected with the ZIP12 
HRE reporter vector demonstrated significantly increased luciferase activity 
after exposure to hypoxia, but not in the cells transfected with the mutant 
HRE vector (n = 6 per group, replicated twice ). f, Increased levels of HIF-1a 
and HIF-2« bound to the ZIP12 HRE assayed by ChIP-qPCR of chromatin 
from HPASMCs cultured in normoxia and hypoxic conditions (n = 3 per 
group, replicated twice). Data are calculated as the percentage of input levels, 
with the dotted line marking percentages below mock immunoprecipitation (IP). 
Values are expressed as mean + s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001 
compared with normoxic control after one-way ANOVA followed by 
Bonferroni correction for multiple testing. NS, not significant. 
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Figure 3 | ZIP12 knockdown inhibits hypoxia-induced increase in 
intracellular labile zinc concentration and proliferation of HPASMCs. 

a, Representative wide-field microscope images of HPASMCs transfected with 
eCALWY-4 probe. Hypoxia exposure produced a striking increase in 
intracellular free zinc (resulting in decreased FRET)"°. This was inhibited by 
transfection with ZIP12 siRNA. TPEN-mediated Zn”* chelation was used to 
derive maximum fluorescence and 100 1M ZnCl, in the presence of the Zn?‘ 
ionophore and pyrithione (ZnPyr) was used to derive the minimum 
fluorescence. b, Representative traces showing the changes in fluorescence ratio 
of the eCALWY-4 probe. Steady-state fluorescence intensity ratio of citrine/ 
cerulean (R) was measured, then maximum and minimum ratios were 
determined to calculate free Zn’* concentration using the formula 

[Zn?*] = Ka X (Rmax — R)/(R — Rmin), Where the dissociation constant (Kg) 


the relevance of our observations in rats to other susceptible 
animal species, as well as humans, we examined ZIP12 expression in 
whole lung samples of (1) neonatal calves housed in a normal atmo- 
sphere or exposed to hypobaric hypoxia for 2 weeks (barometric 
pressure = 445 mm Hg (1 mm Hg = 133.3 Pa), equivalent to 4,500 m 
altitude, 12% O.), (2) older (yearling) cattle with naturally occurring 
pulmonary hypertension (so-called “Brisket disease’) developed as a 
result of prolonged pasturing at high altitude (2,438-3,505 m) and (3) 
human subjects at sea level and Kyrgyz highlanders residing above 
2,500 m. ZIP12 expression, which is undetectable by immunohisto- 
chemistry in healthy bovine and human lung exposed to a normal 
oxygen atmosphere (Fig. 2c), was clearly visible in the remodelled 
pulmonary vessels from chronic hypoxia exposure, indicating that 
ZIP12 upregulation in pulmonary vasculature is a common response 
to hypoxia (Fig. 2c). 

To better understand the regulation of ZIP12 by hypoxia, we 
exposed human pulmonary vascular smooth muscle cells in culture 
to hypoxia (2% Oz). Increased hypoxia-inducible factor (HIF) protein 
and ZIP12 gene expression was observed in hypoxic cells; mRNA 
levels of other zinc transporters, ZIP6, ZIP7, ZIP10 and ZnT8, 
were unchanged (Extended Data Fig. 4a, b). Further examination of 
the Slc39a12 gene using HOMER analysis’ revealed a hypoxia response 
element (HRE) encoding both HIF-1o- and HIF-2a-binding motifs 
(Fig. 2d) located 1 kb downstream of the ZIP12 transcription start site 
(human (hg19) chr10: 18,241,879-18,241,887). We cloned a 1.5 kb 
fragment of the 5’ region of ZIP12 containing this HRE into the 
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for eCALWY-4 is 630 pM, the maximum ratio (Rmx) was obtained upon 
intracellular zinc chelation with 50 tM TPEN and the minimum ratio (Rin) was 
obtained upon zinc saturation with 100 1M ZnCl, in the presence of the Zn2* 
ionophore pyrithione (5 1M) (ref. 10). ¢, Quantification of intracellular zinc 
levels (n = 10 each group). d, Chronic hypoxia (48 h) increases ZIP12 mRNA 
levels in HPASMCs, which is inhibited by Slc39a12 siRNA (n = 5 per 

group). e, Representative immunoblot of ZIP12 demonstrating inhibition 

of hypoxia-stimulated ZIP12 protein expression by Slc39a12 siRNA in 
HPASMCs (n = 3). f, ZIP12 siRNA inhibits hypoxia-induced proliferation 

in HPASMCs (n = 5 per group, replicated twice). *P < 0.05, ***P < 0.001 
compared with control group, #P < 0.05 compared with hypoxia group; scr, 
scrambled siRNA control. 


luciferase reporter vector, pGL4.10 (Fig. 2d). Human pulmonary vas- 
cular smooth muscle cells transfected with the ZIP12 HRE reporter 
vector demonstrated significantly increased luciferase activity after 
exposure to hypoxia, while the luciferase activity of cells transfected 
with the mutant HRE vector (a substitution of the 5’-ACGTG-3' motif 
by 5'-AGCAG-3’; Fig. 2d) remained at basal normoxia levels (Fig. 2e). 
Chromatin immunoprecipitation (ChIP) followed by quantitative 
PCR (qPCR) confirmed the enrichment of both HIF-1a and HIF-2% 
bound to this ZIP12 HRE after hypoxia exposure (Fig. 2f). 

We then explored the contribution of ZIP12 to the regulation of 
intracellular zinc levels in human pulmonary vascular smooth muscle 
cells. Intracellular labile zinc measured using a genetically encoded 
fluorescence resonance energy transfer (FRET)-based zinc probe, 
eCALWY-4 (ref. 10), exhibited a striking increase in cells exposed to 
hypoxia for 48h, which was markedly reduced by inhibiting ZIP12 
expression with a targeted short interfering (si)RNA (Fig. 3a-e). 
Inhibition of ZIP12 expression with siRNA also inhibited hypoxia- 
induced pulmonary vascular smooth muscle cell proliferation (Fig. 3f). 
ZIP12 siRNA transfection did not affect intracellular zinc levels or 
proliferation in normoxia (Extended Data Fig. 4c-f). These data sug- 
gest that disrupted ZIP12 expression exerts a direct effect on pulmo- 
nary vascular cells in response to hypoxia and contributes to the 
resistant pulmonary hypertension phenotype exhibited in F344 strain. 

To provide direct genetic confirmation that disrupted ZIP12 
expression attenuates the pulmonary vascular response to hypoxia, 
we employed zinc finger nuclease technology" to introduce mutations 
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in Slc39a12 in the hypoxia-susceptible WKY rat strain. A mutant line 
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in hypoxia (Extended Data Fig. 6h, i). However, given the time- 


was generated containing a frame-shift resulting in a truncated ZIP12 dependent induction of ZIP12 expression in pulmonary vasculature 
protein with loss-of-function (Extended Data Fig. 5). Inter-cross of by hypoxia, the main contribution of ZIP12 is likely to be in regulating 


heterozygous animals generated homozygous (ZIP12-‘~ ), heterozyg- 
ous (ZIP12*’~) and wild-type (WT) rats that were then exposed to 
hypoxia (10% O,) for 2 weeks. ZIP12~'~ rats demonstrated lower 
pulmonary artery pressures, reduced right ventricular hypertrophy 


the response to chronic rather than acute hypoxia. In further support 
of a direct effect on structural remodelling of pulmonary arterioles, we 
investigated angiogenesis ex vivo using pulmonary arteriole rings dis- 
sected from ZIP12-‘~ and WT rats. Vascular outgrowth from 


and less vascular remodelling than WT rats (Fig. 4a-c and Extended ZIP12 ‘~ vessels in response to hypoxia was attenuated compared 


Data Fig. 6a-d) with ZIP12‘’~ rats exhibiting an intermediate pheno- 
type. WT rats resembled WKY rats after exposure to hypoxia, showing 
markedly increased lung ZIP12 expression in the remodelled pulmo- 
nary arterioles, in contrast to the absence of expression in ZIP12~/~ 
rats (Fig. 4d, e). Comparison of the ZIP12~‘~ response to hypoxia 
with the WKY and F344 parental strains revealed that mutation of 
Slc39a12 was responsible for about 50% of the resistance observed in 
the F344 strain, highlighting the importance of Slc39a12 as a hypoxia- 
susceptibility gene but also suggesting that other genes yet to be iden- 
tified might also contribute. 

Systemic blood pressure and cardiac output in the hypoxic 
ZIP12~‘~ rats was similar to that of WT rats (Extended Data 
Fig. 6e-g), signifying that the reduced pulmonary artery pressures in 
the ZIP12-‘~ rat in chronic hypoxia is due to reduced pulmonary 
vascular resistance (PVR; mean pulmonary artery 
pressure = PVR X cardiac output). Both vascular tone and structural 
remodelling contribute to PVR, and increased pulmonary vascular 
tone precedes the structural changes. ZIP12 expression may increase 


with vessels from WT rats (Extended Data Fig. 6), k). 

The underlying mechanisms through which ZIP12 affects hypoxic 
responses remain to be defined. Excess intracellular zinc concentra- 
tions mediated by upregulation of ZIP family members have been 
observed in a variety of tumour tissues and linked to cell proliferation 
and survival'*’*. Zinc is a structural component of a large variety of 
intracellular proteins, including enzymes and transcription factors. 
Zinc-binding motifs have been identified in drug targets for pulmo- 
nary hypertension, for example phosphodiesterase type 5 (PDE5) and 
histone deacetylases'*’’. Reduced ZIP12 expression and intracellular 
labile zinc levels would be expected to inhibit PDE5 activity'®, and we 
have previously shown that PDE5 inhibition attenuated pulmonary 
vascular smooth muscle proliferation in culture”. 

Following on from our demonstration that ZIP12 is hypoxia indu- 
cible and a key regulator of the pulmonary vascular response to 
chronic alveolar hypoxia exposure, we examined lung ZIP 12 express- 
ion in other presentations of pulmonary hypertension where tissue 
hypoxia is an important driver of pathology. Again, in contrast to 


PVR by increasing pulmonary vascular tone. Zinc thiolate signalling healthy lungs, ZIP12 expression was clearly evident in lung tissues 


has been reported to mediate the constriction of pulmonary micro- 
vascular endothelial cells in acute hypoxia through activation of pro- 
tein kinase C and inhibition of myosin light chain phosphatase, 
inducing stress fibre formation and endothelial cell contraction’. 
We have shown that ZIP12-targeted siRNA attenuates stress fibre 


from chronic iron-deficient rats” and rats exposed to monocrotaline, 
as well as in patients with idiopathic pulmonary arterial hypertension 
(IPAH)” (Fig. 4f), prominent in the remodelled pulmonary vascula- 
ture as identified by co-staining with smooth muscle actin (Fig. 4g). 
HIF activation in these tissues was confirmed by upregulation of car- 


formation in human pulmonary vascular smooth muscle cells cultured _ bonic anhydrase IX, a recognized HIF-regulated biochemical signature 
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Figure 4 | Genetic disruption of ZIP12 in WKY rat attenuates hypoxia- 
induced pulmonary hypertension. a-c, Zinc finger nucleases were used to 
disrupt ZIP12 in the WKY strain. Rats deficient in ZIP12 demonstrate allele 
dose-dependent attenuation of hypoxia-induced pulmonary hypertension 
compared with WT rats: a, mPAP; b, right ventricular hypertrophy (normoxia 
groups: n = 10 WT, 8 ZIP12*/-, 12 ZIP12~/"; hypoxia groups: n = 14 WT, 
16 ZIP12*’~, 12 ZIP12~‘~); c, pulmonary arteriole muscularization (n = 5 
per group). ***P < 0.001 compared with normoxia WT group, #P < 0.05 
compared with hypoxia WT group after one-way ANOVA followed by 


ZIP12 ZIP12/SMA/Hoechst 


50 um 


Bonferroni correction for multiple testing. d, ZIP12 was undetectable by 
western blot in hypoxic ZIP12 ‘~ rats but increased in hypoxic WT (WKY) 
rats (n = 3 per group). e, ZIP12 expression by immunohistochemistry of WT 
and ZIP12 ‘~ rat lungs before and after hypoxia (2 weeks). f, ZIP12 expression 
in lungs from a chronic iron-deficient rat, monocrotaline (MCT) rat and a 
patient with IPAH. g, Double immunofluorescence demonstrates co- 
localization of ZIP12 and smooth muscle actin in the remodelled vessels from 
the patient with IPAH. 
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of tissue hypoxia” (Extended Data Fig. 7). Interestingly, the F344 
rat strain has previously been reported to exhibit some resistance to 
monocrotaline-induced pulmonary hypertension”’; this was recapitu- 
lated in the ZIP12~‘~ rat (Extended Data Fig. 8). These data signal a 
fundamental role for ZIP12 in the regulation of pulmonary vascular 
homeostasis in hypoxic stress, relevant to the pathogenesis of pulmo- 
nary hypertension beyond that associated with life in a low-oxygen 
atmosphere. The current treatments for pulmonary hypertension 
centre on the pharmacological manipulation of signalling mechanisms 
used by vasoactive factors and have limited therapeutic benefit. Our 
observations open a new avenue of research into the therapeutic 
potential of ZIP12 inhibition and suppressed excursions of intracellu- 
lar free zinc as a novel strategy for preventing or treating pulmonary 
hypertension. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 
Animals. Inbred Wistar-Kyoto (WKY, Charles River) and Fischer 344 rats (F344, 
Harlan) were used as original strains. Animals were maintained at a constant 
temperature (20-22 °C) with a 12-h on/12-h off light cycle, with food and water 
ad libitum. All experiments were conducted under the project licence PPL70/7425 
in accordance with the UK Home Office Animals (Scientific Procedures) Act 1986. 
To ascertain the pulmonary artery pressure phenotype, male rats (WKY, F344, 
congenic and sub-congenic strains, ZIP12 transgenic rats) aged 10-12 weeks were 
studied in batches, with the parental WKY strain included as an internal control in 
each batch studied. Sample sizes were chosen on the basis of experience of pul- 
monary artery pressure variation in the parental strains. A sample size of at least 
n= 5 per group was predicted to detect a difference in mean pulmonary arterial 
pressure’ of 5 mm Hg (SD = 3) with 95% power with 95% confidence. Additional 
animals were studied to obtain sufficient tissue for supportive analyses. 
Generation of congenic and sub-congenic strains. To investigate the involve- 
ment of the chromosome 17 QTL in the pulmonary hypertension (PH) phenotype, 
we introgressed the F344 chromosome QTL segment into the WKY genetic back- 
ground by repeated backcrossing’. We produced a congenic rat strain, R47A 
(WKY.F344-D17Got91/D17Rat51), which contains 15 Mbp from the F344 donor 
region that maps to the distal end of the QTL on the WKY background. 
Subsequently, we generated three sub-congenic strains: Sub-A (WKY.F344- 
D17Got91/D17Rat47), Sub-B (WKY.F344-D17Rat47/D17Rat51) and Sub-C 
(WKY.F344-D17Rat131/D17Rat51). These three recombination events divided 
the R47A congenic interval into three smaller and overlapping sub-congenic 
intervals (Extended Data Fig. 1). 
Microsatellite genotyping of congenic rats. Congenic and sub-congenic 
rats were genotyped using simple sequence length polymorphism markers 
(Extended Data Table 2). To reduce the unknown regions between the markers, 
rats were also genotyped using primers specifically designed to amplify known 
regions containing insertions or deletions in one of the two parental strains 
(Extended Data Table 2). Genomic DNA was isolated from rat ear clippings using 
hot sodium hydroxide and Tris (HotSHOT) extraction”. Forward primers were 
fluorescently labelled with 6-FAM. PCR products together with the fluorescent 
size marker (ROX 400HD, Applied Biosystems) were diluted in formamide and 
run ona 3730xl DNA Analyzer (Applied Biosystems). Results were analysed using 
GeneMapper version 3.7 software (Applied Biosystems). 
Illumina genome sequencing library preparations. Five micrograms of male 
WKY/Ncrl (two animals) and F344/Ncrl (one animal) rats were used to construct 
paired-end whole-genome libraries with 300-550 bp insert size. Genomic DNA 
was prepared by standard phenol chloroform extraction followed by treatment 
with DNase-free RNase. DNA quality was assessed by spectrophotometry (260/ 
280 and 260/230) and gel electrophoresis before library construction. Genomic 
DNA was sheared for 90s (Covaris $2, KBioscience), using 10% duty cycle, 
5 intensity and 200 cycles per burst. The shearing efficiency was assessed by 
Qubit 2.0 fluorometer measurements (Life Technologies) and gel electrophoresis. 
The library was prepared as recommended (Illumina Genomic DNA sample 
prep kit protocol) with nine cycles of PCR amplification (Illumina). Constructed 
libraries were assessed with an Agilent 2100 bioanalyser using an HS DNA assay 
(Agilent Technologies) and quantified using a KAPA Illumina SYBR Universal 
Lib QPCR kit (Anachem). The resulting libraries were sequenced on an Ilumina 
HiSeq2000 following the manufacturer’s instructions. Polymorphisms were con- 
firmed by capillary sequencing. 
Generation and genotyping of WKY.SIc39a12*’~ and $Ic39a12~'~ rats. 
CompoZr Custom Zinc Finger Nucleases targeting the rat Slc39a12 gene were 
designed and purchased from Sigma-Aldrich (Extended Data Fig. 3). Pronuclei 
from fertilized WKY oocytes were microinjected with ZFN mRNA (2 ng” '). 
Three out of eleven pups were positive for Slc39a12 mutations, as revealed by Cel-I 
surveyor assay and gene sequencing'’. One pup (mutant 77) hosted a stop codon 
15 amino acids from the ZFN-binding site, resulting in a truncated protein of 
490 amino acids (54kDa), 198 amino acids smaller than the WT protein, and 
introduced a sequence coding for 5'-ATTTAAAT-3’, a binding site for the Swal 
restriction enzyme. Mutant 77 was selected as a founder to mate with a WKY 
female. Pups were genotyped by amplifying DNA and digesting with Swal. The 
primers used to amplify the region of interest were forward 5’-GCAATG- 
GTTTTCCACAGTGA-3’ and reverse 5'-GCGCACTGAGGCTTTAAGAA-3’. 
Pulmonary hypertension phenotyping. Pulmonary hypertension was induced 
by placing animals in a normobaric hypoxic chamber (FIO, = 10%) for 2 weeks or 
by subcutaneous injection of monocrotaline (60 mgkg ’; Sigma-Aldrich). The 
experiments were not randomized. All studies were performed using the same 
equipment and all haemodynamic measurements were made by the same two 
operators. The operators were blinded to the genetic status of animals, which was 
confirmed after phenotyping. All histological assessments were made by two 
observers blinded to genetic status of the animals. At the end of each experimental 
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period, animals were weighed and anaesthetized (Hypnorm 1 ml kg” ' intramus- 
cularly; Mydazolam 0.8 ml kg”! intraperitoneally). Pulmonary arterial pressure 
was measured with a pre-curved catheter inserted through the right jugular vein. 
Systemic blood pressure was assessed via carotid artery cannulation. Cardiac out- 
put was measured by thermodilution. PVR was calculated using the standard 
equation, PVR = mean pulmonary artery pressure/cardiac output. All data were 
recorded with a PowerLab Data Acquisition system (AD Instruments) and ana- 
lysed using LabChart 7 software. 

The animals were then killed and the heart dissected, and individual chamber 
weights recorded. The ratio of right ventricle to left ventricle plus the septum mass 
was calculated as the right ventricle hypertrophy index. Some collected tissues 
were snap frozen in liquid nitrogen and stored at —80 °C for further biochemical 
measurements. The left lung was fixed by inflation with 10% formalin in phos- 
phate-buffered saline, embedded in paraffin and sectioned for histology. 
Transverse rat lung sections were processed for elastic van Gieson staining. 
Peripheral vessels less than 100 1m in diameter were counted at X40 magnifica- 
tion objective, and pulmonary vascular remodelling was expressed as the propor- 
tion of vessels with double elastic lamina (>75% of the circumference as fully 
muscularized, 25-75% as partly muscularized) to total vessels counted. Counting 
was performed twice by observers blinded to treatment. 

Ex vivo angiogenesis assay of pulmonary arteriole. Angiogenesis assay of arter- 
ial rings was performed as previously described’*. Pulmonary arterioles (first and 
second order) were dissected from rat lungs viewed under the microscope. One 
millimetre sections were placed in matrigel (50 ul per well) in a 96-well plate, 
allowed to gel for 30 min at room temperature (20°C), then incubated for up to 
6days with endothelial cell culture medium MV2 with 5% fetal calf serum 
(PromoCell). On days 3, 4, 5 and 6, the length of the longest sprouts was measured 
under the microscope (X 4 objective). On day 6, fluorescent images of the arteriole 
ring were taken by staining the tissue with calcein (Invitrogen) for 15 min at 37 °C. 
Anti-ZIP12 antibody production. An antibody raised against the last five amino 
acids at the C terminus of both the human and the rat ZIP12 protein was produced 
in rabbits following previous methodology”®. Rabbits were immunized with syn- 
thetic peptides conjugated to keyhole limpet haemocyanin ([CYS(KLH)]QNIKI). 
Peptide sequence was confirmed to be ZIP12 specific using RStudio. Immunized 
rabbit serum containing anti-ZIP12 antibody specificity was confirmed by immuno- 
blotting with rat lung lysates or human pulmonary smooth muscle cells. A single 
band at about 70 kDa was visible in the immunoblots. 

Lung immunohistochemistry and immunofluorescence. Human IPAH and 
control lung samples were obtained from the Imperial College pulmonary hyper- 
tension biorepository (ethics reference numbers 01-210 and 2001/6003) where 
samples are deposited following informed patient consent. Anonymized Kyrgyz 
high-altitude lung samples were obtained from post-mortem lung following 
approval of the local ethics committee (reference 02-23/880). 

Lung sections were immunostained with rabbit anti-ZIP12 (1:1,000), Ki67 
(1:50; Thermo Scientific) and rabbit anti-CAIX (1:100) antibodies. For immuno- 
histochemistry, horseradish-peroxidase-conjugated secondary anti-rabbit anti- 
body (1:200) was used. Double immunofluorescence with anti-~SMA (1:100) 
was performed using fluorescence secondary antibodies, anti-mouse Alexa 488 
and anti-rabbit Alexa 568 (1:2,000, Invitrogen). Images (green for ZIP12 and red 
for «SMA) were detected under Leica confocal microscope (TCS SP2 AOBS). 
HPASMC culture. HPASMCs from PromoCell and Lonza were grown in human 
smooth muscle cell growth medium 2 (PromoCell). All commercial cell lines 
were tested for mycoplasma contamination. The cells were cultured under normal 
oxygen tension (20% O32, 5% COz) or exposed to hypoxia (2% O, 5% CO2, 92% N2) 
for 48-72h. A bromodeoxyuridine (BrdU) cell proliferation assay (Millipore) 
was used to assess cell proliferation following the manufacturer’s conditions. 
ZIP12 siRNA transfection. Cells were transfected overnight with 50 pmol siRNA 
against ZIP 12 (s8397, Ambion), or negative siRNA (4390844, Ambion) as a con- 
trol, using Lipofectamine RNAiMAX (Invitrogen Life Technologies) according to 
the manufacturer’s conditions. 

Quantification of actin fibre formation. Cells were cultured on plastic coverslips 
(Nunc), transfected with scrambled or ZIP12 siRNA and exposed to hypoxia 
as described previously. After exposure for 48h, cells were fixed with 4% form- 
aldehyde solution in phosphate buffered saline for 10 min at room temperature. 
Cells were then incubated with Alexa-568-conjugated phalloidin (1/200; 
Invitrogen) for F-actin detection by confocal microscopy. Sequential xyz-sections 
(approximately 12 sections of 1 1m? per view) were obtained and three-dimen- 
sional images were reconstructed. Quantification of actin stress fibres was deter- 
mined by volume rendering in Image-J. Actin volume per cell was expressed as the 
fold increase from normoxic control (value set at 1). 

Quantification of zinc concentration by FRET measurement. Cells on cover- 
slips were washed twice in Krebs-HEPES-bicarbonate (KHB) buffer (140 mM 
NaCl, 3.6mM KCl, 0.5mM NaH>POu,, 0.2mM MgSOg,, 1.5mM CaCh, 10mM 
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HEPES, 25 mM NaHCO), which was warmed, bubbled with 95% O,/5% CO,, set 
to pH 7.4 and contained 3 mM glucose. Imaging of zinc using eCALWY sensors 
was performed as optimized before'®”’. Briefly, cells were maintained at 37°C 
throughout with a heating stage (MC60, LINKAM, Scientific Instruments), 
and KHB buffer was perfused (1.5-2 ml min ') with additions as stated in the 
figures. Images were captured at 433 nm monochromatic excitation wavelength 
(Polychrome IV, Till Photonics) using an Olympus IX-70 wide-field microscope 
with a X40/1.35 numerical aperture oil immersion objective and a Zyla sCMOS 
camera (Andor Technology) controlled by Micromanager software”**. Acquisition 
rate was 20 images per minute. Emitted light was split and filtered by a Dual- View 
beam splitter (Photometrics) equipped with a 505dcxn dichroic mirror and 
two emission filters (Chroma Technology D470/24 for cerulean and D535/30 
for citrine). 

Image analysis used Image] software” with a homemade macro, and the fluor- 

escence emission ratios were derived after subtracting background. Steady-state 
fluorescence intensity ratio of citrine/cerulean (R) was measured, then maximum 
and minimum ratios were determined to calculate free Zn’ * concentrations using 
the following formula: [Zn?*] = Ka X (Rmax — R)/(R — Rmin). The maximum 
ratio (Rmax) Was obtained upon intracellular zinc chelation with 50 1M TPEN 
and the minimum ratio (R,,in) was obtained upon zinc saturation with 100 1M 
ZnCl, in the presence of the Zn** ionophore pyrithione (5 1M) (ref. 10). 
HIF-motif analysis and cloning. HOMER’ was used to scan for HIF-1o and HIF- 
2a recognition motifs in the region 2 kb upstream and 1.5 kb downstream of the 
ZIP12 transcription start site. Results with a HOMER score <6.5 were discarded. 
A 5’ region of ZIP12 gene containing these motif (HRE) (human (hg19) chro- 
mosome 10: positions 18,240,587-18,242,100) was cloned into the multicloning 
site of pGL4.10, which encodes the luciferase reporter gene luc2, by Gibson 
Assembly (NEB, E2611S). Three nucleotide substitutions in the core of the pre- 
dicted HIF1/2«-binding site motif were created by site-mutagenic PCR to produce 
a disabling mutant (Fig. 2d). 
Transfection and luciferase assay. HPASMCs were seeded in 24-well plates at 
70-80% confluence. Cells were transfected with 300 ng of each plasmid together 
with 2 ng of Renilla plasmid using Lipofectamine 2000 (Life Technologies), exposed 
to hypoxia and lysed according to the manufacturer’s conditions. Luciferase activity 
was measured using Dual-Luciferase Reporter Assay Chemistry (Promega) as 
previously described”. Experiments were repeated in two cells lines, n = 6 per line. 
ChIP and PCR. Specific protein-DNA interactions were examined by ChIP fol- 
lowed by qPCR (Chromatin Immunoprecipitation Assay Kit, Millipore). Protein- 
DNA crosslinks were achieved by fixation with 1% formaldehyde for 10 min at 
room temperature. DNA-protein complexes from 2 X 10° cells were sheared to 
lengths between 200 and 500 base pairs by sonicator (Bioruptor). The pre-cleared 
fragments were incubated with 101g of HIF-1« or HIF-20 specific antibody 
(Novus Biologicals), or without antibody (as a negative control) overnight, fol- 
lowed by immunoprecipitation by Protein A Agarose/Salmon Sperm DNA (50% 
Slurry). The crosslinks were reversed by heating at 65 °C overnight, followed by 
Proteinase K digestion at 45°C for 2h. DNA was then recovered with QIAquick 
PCR purification kit (Qiagen) for qPCR to prove affinity against ZIP12 promoter 
region (Fig. 2d). Experiments were conducted in two separate cell lines (n = 3 
each) and gave the same result. 

Quantitative PCR was performed as previously described in the methodology, 
using 1p of DNA sample, and using the forward primer 5-TTTCCCAAC 
CTGGGTCCTAT-3 and the reverse primer 5-AGCAGCCAAAAAGCTTGC 
TA-3. C terminus values were normalized compared with the values detected in 
the starting non-inmunoprecipitated DNA sample (input). Protein-DNA affinity 


was confirmed when normalized C terminus values were above the basal levels 
measured in the negative control. 

Quantitative PCR with reverse transcription. RNA was extracted from lungs 
using RNeasy Mini Kit (Qiagen). PCR was performed with an ABI 7500 Sequence 
Detection System (Applied Biosystems). Quantitative PCR was performed using a 
two-step protocol starting with complementary DNA (cDNA) synthesis using the 
ImProm-II Reverse Transcription System (Promega), followed by PCR using the 
Power SYBR Green PCR Master Mix (Applied Biosystems). A total of 100 ng of 
cDNA per sample was used. All samples were amplified using biological triplicates 
with two technical replicates per sample. The 7500 Sequence Detection System 
software (Applied Biosystems) was used to obtain Cy values. Results were analysed 
using the comparative Cy method*'. Samples were normalized to a reference gene, 
Ubc (for rat samples) or Cyclophilin (for human cell samples), to account for 
cDNA loading differences. 

Western blot. Frozen rat tissues (lungs) and cell pellets were homogenized in 
RIPA buffer (50 mM tris-HCl, pH 8.0, 150 mM sodium chloride, 1.0% Igepal, 0.5% 
sodium deokycholate, 0.1% sodium dodecyl sulphate) (Sigma) supplemented with 
protease inhibitor (Roche). Western blotting used Mini-PROTEAN TGX Precast 
Gels (Bio-rad) following the manufacturer’s suggestions. Blots were incubated for 
1h at room temperature with Anti-ZIP12 (1:10,000); Anti-HIF1o (1:1,000, Novus 
Biological) or Anti- HIF2« (1:1,000, Novus Biological). Proteins were detected by 
Clarity western ECL substrate (Bio-rad). Optical densities of individual bands 
were measured using Image] software and protein expressions were standardized 
with vinculin. 

Statistical analysis. Data are presented as mean + s.e.m. Data were tested for 
normality using a Kolmogorov-Smirnov test. All data were confirmed as normally 
distributed with similar variance between comparator groups. Data were analysed 
using one-way ANOVA followed by Bonferroni post-test adjustment for multiple 
comparisons or unpaired t-test. Graphpad Prism was used for all statistical analysis. 
Other bioinformatics analyses. The Ensembl database” was mined with the 
BioMart tool* to identify all transcribed elements in the confidence interval 
region. The search was limited to chromosome 17 between positions 85,072,475 
and 93,347,758. PolyPhen analysis was used to predict the possible impact of 
described SNP on amino-acid substitution on the structure and function of a 
human protein™*. 
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Extended Data Figure 1 | Generation of congenic and sub-congenic strains. 
Congenic rat lines were produced by introgression of the F344 chromosome 17 
QTL segment onto the WKY genetic background by repeated backcrossing. 
Congenic rat strain R47A (WKY.F344-D17Got91/D17Rat51) contains 15 Mbp 
from the F344 donor region that maps to the distal end of the QTL on a WKY 
background. Three sub-congenic strains, SubA (WKY.F344-D17Got91/ 


D17Rat47), SubB (WKY.F344-D17Rat47/D17Rat51) and SubC (WKY.F344- 
D17Rat131/D17Rat51), were produced containing separate fragments of the 
R47A donor region by backcrossing (R47A X WKY) F, with WKY parental 
rats. Three recombination events within the R47A congenic interval divided the 
congenic interval into three smaller and overlapping sub-congenic intervals 
(Fig. 1 and main text). 
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Extended Data Figure 2 | Dissection of QTL and cardiovascular phenotype 
of rat strains. a, An illustrative genetic map showing the relationship of the 
congenic strains (R42, R47A), subcongenic strains (SubA, SubB, SubC) and 
Slc39a12 to the original QTL (defined by a LOD score >3 (ref. 2)) on a physical 
map of chromosome 17 (using Rat Genome Assembly version 5.0). b, The 
hypoxia-resistant F344 phenotype tracks with the congenic R47A line. Rats 
were kept in 10% O, for 2 weeks and right ventricular hypertrophy was 
significantly attenuated in the congenic R47A strain (0.32 + 0.03, n = 13, 
**P < 0.01) compared with WKY rats (0.37 + 0.03, n = 15), whereas congenic 
R42 rats (0.36 + 0.03, n = 17) were similar (NS) to WKY rats. In normoxia, 
WKY, F344, R47A, SubA, SubB and SubC rats show no significant differences 


(©) Non-muscularized 
@) Partially muscularized 
@® Fully muscularized 


% Muscularization 


in (c) mPAP, (d) right ventricular hypertrophy and (e) vascular 
muscularization (n = 8 each group); f, systemic blood pressure (SBP) is similar 
in all strains in both normoxia and hypoxic conditions. g, F344, R47A and 
SubB rats exhibit attenuated pulmonary vascular remodelling after 2 weeks 
exposure to a 10% O, atmosphere compared with WKY, SubA and SubC rats 
(n = 6 each group). Values are expressed as the mean + s.e.m. *P< 0.05, 
**P << 0.01, ***P < 0.001 compared with WKY (percentage of fully 
muscularized and partly muscularized vessels); ##P < 0.01, ###P < 0.001 
compared with WKY (percentage of non-muscularized vessels) after one-way 
ANOVA followed by Bonferroni correction for multiple testing. 
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Extended Data Figure 3 | Hypoxia-induced pulmonary vascular 


remodelling in parental strains. a, Upper panel sequence shows the WKY 


protein sequence (688aa); lower panel shows the truncated F344 protein 


sequence (553aa). Stars (*) mark the mutated amino acids compared with 
WKY protein. Dotted line indicates the C-terminal truncated region in F344. 


50um 


The grey square highlights the metalloprotease motif. b, Prominent ZIP12 
immunostaining is seen in remodelled pulmonary arterioles in the chronically 
hypoxic WKY rat alongside vessels with a double elastic lamina (stained 

with elastic Van Gieson) but not F344 lungs exposed to hypoxia. Red arrow, 
vessel with double elastic lamina; blue arrow, vessel with single elastic lamina. 
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Extended Data Figure 4 | ZIP12 upregulation in response to hypoxia 
exposure and measurements of intracellular labile zinc concentration and 
proliferation of HPASMCs in normoxic conditions. a, Upregulation of 
ZIP 12 in HPASMCs exposed to hypoxia, in contrast to other zinc transporters 
(n = 6). b, Representative western blots demonstrating increased HIF-1o. and 
HIF-2« expression in HPASMCs after exposure to hypoxia for 24h. c, Confocal 
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laser scanning images of HPASMCs transfected with eCALWY-4 probe. 
Intracellular free zinc was not affected by transfection with ZIP12 siRNA in 
normoxia. d, Representative traces showing the changes in fluorescence ratio 
using the eCALWY-4 probe. e, Quantification of intracellular zinc levels 

(n = 10). f, ZIP12 siRNA did not affect proliferation of HPASMCs in normoxic 
conditions (n = 5). 
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Extended Data Figure 5 | Design of specific Slc39a12 ZFN and confirmation _ insertion (TA) into mutant 77 caused a frame-shift in coding, introducing a 
of mutant line. a, CompoZr Custom Zinc Finger Nucleases (Sigma-Aldrich) stop codon leading to a truncated protein. Red star refers to stop codon. c, We 
for the rat Slc39a12 gene were designed to target exon 8. b—d, Cel-I subsequently genotyped next generation litters using Swal (cutting point: 
surveyor assay and gene sequencing confirmed NHEJ-induced mutations in 5'-ATTTAAAT-3’), showing 100% digestion for homozygous pups (—/—), 
at least one pup (mutant 77). e, The 4-bp (AGTT) deletion followed by 2-bp 50% for heterozygous (+/—) and no DNA digestion for WT rats (+/+). 
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Extended Data Figure 6 | ZIP12 knockout attenuated hypoxia-induced 
pulmonary vascular remodelling. a, Representative lung sections from WT 
and ZIP12 ‘~ rats 2 weeks after hypoxia exposure. Elastic van Gieson staining 
showing double elastic lamina (red arrow) in WT but single elastic laminae 
(blue arrow) in ZIP12/~ rats. b, Genetic disruption of ZIP12 in WKY rat 
attenuated pulmonary vascular remodelling after exposure for 2 weeks toa 10% 
O, atmosphere compared with WT rats (n = 5 each group). *P<0.01 
compared with WT (percentage of fully muscularized vessels); ##P < 0.01, 
###P < 0.001 compared with WT (percentage of non-muscularized vessels) 
after one-way ANOVA followed by Bonferroni’s multiple comparison test. 

c, Ki67 staining showing reduced proliferation in hypoxic ZIP12~/~ rat lungs 
compared with the WT strain. *P < 0.01 compared with WT. d, Representative 
sections from hypoxic WT and ZIP12 ‘~ rats lungs showing differences in 
staining with the proliferation marker, Ki67. e-g, Genetic disruption of ZIP12 
in WKY rat did not influence (e) systemic blood pressure (SBP) or (f) cardiac 


LETTER 


output (CO) but attenuated hypoxia-induced increases in (g) PVR (n = 7 each 
group). Values are expressed as the mean = s.e.m. *P < 0.05, **P < 0.01 
compared with normoxic rats, #P < 0.05 compared with WT hypoxic rats after 
one-way ANOVA followed by Bonferroni correction for multiple testing. 

h, ZIP12-targeted siRNA inhibition attenuates stress fibre formation in 
HPASMCs in hypoxia (n = 5 each group). **P < 0.01 compared with 
normoxia control group, #P < 0.05 compared with hypoxia control group. 

i, Representative pictures of actin stress fibre in HPASMCs. j, Ex vivo 
angiogenesis studies demonstrated that vascular outgrowth from ZIP12 /~ 
pulmonary vessels in response to hypoxia was attenuated compared with 
vessels from WT rats (n = 12 each group, 2 rings per rat, 6 ZIP12~/~ and6 WT 
rats). *P << 0.05 compared with normoxia WT group; #P < 0.05, ##P < 0.01 
and ###P < 0.001 compared with hypoxia ZIP12-’~ group. k, Representative 
pictures of pulmonary arteriole ring outgrowth at day 6. 
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Extended Data Figure 7 | Carbonic anhydrase (CAIX) expression. low-altitude (normoxia control, CO calf) calves and sea-level humans, but 
a, Representative sections demonstrating increased CAIX expression in prominent CAIX immunostaining was observed in the remodelled pulmonary 
remodelled pulmonary arterioles in the lungs of rats exposed to alveolar arteries of calves with severe pulmonary hypertension (Hx calf), in cattle 


hypoxia (2 weeks), monocrotaline (MCT, 3 weeks) or a chronic iron-deficient — with naturally occurring pulmonary hypertension (‘Brisket disease’, BD) 
diet (4 weeks). b, c, No CAIX staining was detected in pulmonary arteries of as well as patients with IPAH. 
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Extended Data Figure 8 | Genetic disruption of ZIP12 in WKY rat t-test. d, Representative lung sections from WT and ZIP12~/~ rats 3 weeks 
attenuated monocrotaline-induced pulmonary hypertension. a, mPAP, after monocrotaline injection. Elastic van Gieson staining showing double 
(b) right ventricular hypertrophy and (c) pulmonary arteriole muscularization __ elastic lamina (red arrow) in WT but single elastic laminae (blue arrow) 
(n = 5 each group). Values are expressed as the mean + s.e.m. *P < 0.05, in ZIP12 ‘~ rats. 


**P < 0.01 compared with WT monocrotaline group after unpaired Student’s 
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Frameshift and non-synonymous coding mutations in the refined congenic interval of F344 and the other hypoxia- 
susceptible strains, WKY, spontaneously hypertensive (SHR) and fawn-hooded (FHH) rat strains 


Position Strain 
5 Reference Alternate 9 & 4 z rf S28 _ Amino 

Gene Name  ENSEMBLgeneID —ENSEMBLtranscript ID SNP/IndelID_— Chromosome. fay “me Me SS azs z g = cman cDNA @s Protein Codon Potyphen 

26522 = g z Position position position Change Prediction 

fSEEeas eB jange 
Cdnf ENSRNOG00000026493 ENSRNOT00000036299 17_85839161_G/A v7 85,839,161 = G A 2 0 62000 NON_SYNONYMOUS_CODING 275 25 72 AN  gCg/eTe Benign 
Hspai4 ENSRNO©GO00000015196 ENSRNOT00000020384 17_85852829 G/C v7 85,852,829 5 G < 2 0 oooo NON_SYNONYMOUS_CODING 478 418 160 D/H Gat/Cat_ Probably damaging 
Hspal4 ENSRNOGO0000015196 ENSRNOT00000020384 17_85853753_A/G 7 85,853,753 - A G 2}|2]0 ooaooo NON_SYNONYMOUS_CODING B44 a4 282 T/A Act/Gct Benign 
Hspal4 ENSRNOG00000015196 ENSRNOTO0000020384 17_85853981_A/G v $5,853.98 - A « [2]2]o 0600 NON_SYNONYMOUS_CODING 1072 1072 358 = N/D Aat/Gat Benign 
Detreic ENSRNOG00000015980 ENSRNOTO0000021506 rs8173830 v7 85,896,820 - G6 A 2 o ooaooo NON_SYNONYMOUS_CODING 1922 1867 623 P/sS Cet/Tet Benign 
Delreic ENSRNOGO0000015980 ENSRNOTO0000021771 138173830 v7 85,896,829 G Ae 0 0000 NON_SYNONYMOUS_CODING 1922 1867 623 © P/S Cet /Tet Benign 
D4A4W9_RAT ENSRNOG00000038045 ENSRNOTOO000057895 17_85993589_G/A v7 85,993,589 - G a [2jol2 2]}0 0 0 NON_SYNONYMOUS_CODING 172 170 57 R/HcGt/cAt Benign 
F1M6A7_RAT ENSRNOG00000038037 ENSRNOTO0000057886 17_86014636_TGTA/- v7 86,014,636 86,014,639 TGTA = 2/0/2 2/0 0 0 FRAMESHIFT_CODING 1063-10665 1063-1066 355-356 = = cd 
FAM6A7_RAT ENSRNOG00000038037 ENSRNOTO0000057885 17_86014636_TGTA/- v 85,014,636 86,014,639 TGTA - 2}o{2 2/0 00 FRAMESHIFT_CODING 901-904 901-904 301-302 - - - 
FIM6A7_RAT ENSRNOG00000038037 ENSRNOTOO000057886 17_86014638_TA/- v7 86,014,638 86,014,639 TA * 2/0/2 2/0 00 FRAMESHIFT_CODING 1063-1063 1063-1064 355 = bd - 
F1IM6A7_RAT ENSRNOG00000038037 ENSRNOTO0000057885 17_86014638_TA/- v7 86,014,638 86,014,639 TA 2 2])0/2 2) 0.6. @ FRAMESHIFT_CODING 901-902 901-902 301 = = = 
FAM6A7_RAT ENSRNOG00000038037 ENSRNOTO0000057886 17_86017004_G/C v7 86,017,004 = G c 2}0/2 2/0 0 o NON_SYNONYMOUS_CODING 1007 1007 336 P/R cCa/cGa Unknown 
FAM6A7_RAT ENSRNOG00000038037 ENSRNOTO0000057886 17_86017414 C/A 7 (017,414 - c A [2jol2 2/0 0 0 NON_SYNONYMOUS_CODING 597 27 199 M/l_—aG/atT Unknown 
FIM6A7_RAT ENSRNOG00000038037 ENSRNOTOO000057885 17_86017414 C/A 7 (86,017,414 * c A 2}0/2 2/0 0 0 NON_SYNONYMOUS_CODING 513 513 71 M/i aG/atT Unknown 
FAM6A7_RAT ENSRNOG00000038037 ENSRNOTO0000057886 17_86017921_-/GT v7 86,017,921 865,017,922 - et [o]2[o ole FRAMESHIFT_CODING 104-105 10410535 - - - 
FIM6A7_RAT ENSRNOG00000038037 ENSRNOTO0000057885 17_86017921_-/GT a 86,017,921 86,017,922 - et [ol2[o Oo] 2 2 2 FRAMESHIFT_CODING 67 6-7 23 - - - 
Nmt2 ENSRNOGO0000026248 ENSRNOT00000030228  17_86060033_C/T v7 86,060,033 = c T Ale oO ooo0oo0 NON_SYNONYMOUS_CODING 134 3B4 45 S/N aGt/aAt Benign 
Nmt2 ENSRNOG00000026248 ENSRNOTOOO00065609 17_86080154_-/G vv 86,080,154 86,080,155 G im = 2 o| 25.2 FRAMESHIFT_CODING 86-87 86-87 29 - - - 
Cubn ENSRN©G00000029047 ENSRNOT00000040052 17_87738747_G/A 7 87,738,747 - 6 A 2 r) noe mome NON_SYNONYMOUS_CODING 1310 1310 437 S/L  tcg/tTg Benign 
St8sia6 ENSRNOG00000018171 ENSRNOTO0000024475 _17_88008467_G/A v7 88,003,467 - G A 2 0 oooo NON_SYNONYMOUS_CODING 230 20 7 Tl = xXt/ate Benign 
$139a12 ENSRNO©G00000025639 ENSRNOT00000044313 17_88525807_C/A v7 88,525,807 - c A 0 2 2 2 2 2 NON_SYNONYMOUS CODING SPLICE STE 539 539 180 P/Q cCg/cAg Benign 
$139a12 ENSRNOG00000025539 ENSRNOTO0000056690 17_88525807_C/A 7 88,525,807 - ic A 0 2 2 2 2 2 NON_SYNONYMOUS_CODINGSPLICESTTE 539 539 180 P/Q = cCg/cAg Benign 
$1G9a12 ENSRNOG00000025539 ENSRNOT00000044313  17_88535671_T/G v7 88,535,671 ee T G 0 2 yi ae | NON_SYNONYMOUS_CODING 910 10 304 wW/G Tee/Geg Benign 
‘$1c39a12 ENSRNOG00000025639 ENSRNOTO0000066690 17_88535671_T/G vv 88,535,671 - bl G 0 2 22 2 2 NON_SYNONYMOUS_CODING 910 s10 304 W/G Tge/Ggg Probably damaging 
8139212 ENSRNOG00000025639 ENSRNOT00000044313 _17_88575534_T/- 7 88,575,534 8,575,534 9 T - 2 0 oooo0 FRAMESHIFT_CODING 1628 1628 43 E - Truncated Protein 
S139a12___ ENSRN.©G00000025639 ENSRNOT00000066690__17_88575534_T/- v7 88,575,534 88,575,534 2 o oooo FRAMESHIFT_CODING 1520 1520 507 : -__ Truncated Protein 
Genotype code: “andidate genes, Blue =most important polymorphism 


©2015 Macmillan Publishers Limited. All 


ghts reserved 


Extended Data Table 2 | Polymorphism markers for congenic strain genotyping 
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Rat chr 17 Genetic map Primers Expected size (bp) Genotyping information 
Marker SHRSP x BN Physical map possition FORWARD REVERSE WKY F344 R47A SubA SubB SubC 
D17Rat41 38.27 80234337 - 80234511 CCTTTCCCTTTCCACTCTCC GGTAAGGGTGGTGGCAGTAG 171 157 ww WW WW WW 
D17Rat44 40.89 81612362 - 81612488 CAGACAAAACCCCAGCATTT AGCAGAAAGAACCAGGCAGA 133 121 ww WW WW WW 
D17Got91 82277435 - 82277611 CCAGACACCAACATCACACC CCTTCATGTGTGGAGTGTTTATG 160 176 FF FF ww WW 
D17Rat43 40.89 82337230 - 82337376 CACTCACTTGCTGGCTGTCT GAGAAGAAGCTGGAGAGGCA 150 124 FF FF ww WW 
D17Rat42 40.33 82505422 - 82505562 TGCCGCTATTAAAAAGTAACTGC CCAAAGGCATAAAAATCTTTCC 141 121 FF FF ww WW 
D17Rat62 42.35 83479938 - 83480058 GAAAGGATGGCAGGTTTTTG TCCACAGGCTCACTGTCACT 143 ali FF FF ww WW 
D17Rat46 42.33 83600152 - 83600282 TGGGTTCTTTCATTCCTTGC GCTCACCCACACACACATTC 135 125 FF FF ww WW 
D17Rat47 43.34 85072353 - 85072475 CCCTGCTTTCTGCTTTGAAC TGCATATACGAATTACAGCTCAA 114 126 FF FF FF Ww 
Del85862103 85862060 - 85862261 CACCATGAGCTCAGCAGTGT ACACCGTCTGGCTCTCTCAG 203 193 FF WW FF Ww 
In85923365 85923262- 85923495 ACCTTTGGCTCGGTCCTATC AAACTTGGGTACCAGCACCA 235 243 FF WW BE Ww 
D17Got93 86032700 - 86032904 CACTACACCTCCCAACGTCC CTGTTGTGCCTCCTGACTAATG 227 215 FF WW FF Ww 
D17Mits 45.19 87465135 - 87465345 GGTCGGCATTATGGCTAAGA CTATAGCCTCTAGGGAGGGG 193 195 FF WW FF WW 
D17Rat60 45.19 88268817 - 88269054 GGGGTCCAGCACTTAGCAT GTTTTGATCATGGGGACGTT 239 241 FF WW FF Ww 
D17Rat48 45.19 88667790 - 88667952 CACATGTCTAACTTGCCACATACA TITGCTGTTTCTTGTTCATGTG 166 156 FF WW FF WW 
Del89391756 89391713 - 89391951 TCCATGTTTTATCACCGGAAG ATCTGATGCATGCCATAGCC 230 238 FF WW FF Ww 
In90455808 90455697 - 90455921 AAGTTAGCCTTCCCCAAGGA TCTGGTCTTTCCCATGTTCC 233 225 FF WW FF WW 
D17Rat131 47.54 93347784 - 933447990 TTAAGAAGGGCAAGCAAGGA TCCCCATAAAAAGAAAAGGAA 203 213 FF WW FF FF 
D17Rat51 47.54 96587775 - 96587905 TCCCACTGGTCAATCCATTT ACATGCAGACAGAACATT CCT 144 148 FF WW FF FF 
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Transcriptional control of autophagy-lysosome 
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Activation of cellular stress response pathways to maintain meta- 
bolic homeostasis is emerging as a critical growth and survival 
mechanism in many cancers’. The pathogenesis of pancreatic duc- 
tal adenocarcinoma (PDA) requires high levels of autophagy”*, a 
conserved self-degradative process’. However, the regulatory cir- 
cuits that activate autophagy and reprogram PDA cell metabolism 
are unknown. Here we show that autophagy induction in PDA 
occurs as part of a broader transcriptional program that coordi- 
nates activation of lysosome biogenesis and function, and nutrient 
scavenging, mediated by the MiT/TFE family of transcription fac- 
tors. In human PDA cells, the MiT/TFE proteins°—MITF, TFE3 
and TFEB—are decoupled from regulatory mechanisms that con- 
trol their cytoplasmic retention. Increased nuclear import in turn 
drives the expression of a coherent network of genes that induce 
high levels of lysosomal catabolic function essential for PDA 
growth. Unbiased global metabolite profiling reveals that MiT/ 
TFE-dependent autophagy-lysosome activation is specifically 
required to maintain intracellular amino acid pools. These results 
identify the MiT/TFE proteins as master regulators of metabolic 
reprogramming in pancreatic cancer and demonstrate that tran- 
scriptional activation of clearance pathways converging on the 
lysosome is a novel hallmark of aggressive malignancy. 

Autophagy delivers cargo to lysosomes for degradation, suggesting 
the possibility that these systems may be coordinately regulated in 
PDA. Immunostaining for LC3 and LAMP2 revealed significant 
expansion of both organelles in PDA cell lines compared with non- 
transformed human pancreatic ductal epithelial cells (HPDEs) (Fig. la 
and Extended Data Fig. 1a). Notably, transmission electron micro- 
scopy demonstrated an increase in lysosome number per cell in treat- 
ment-naive PDA specimens relative to normal pancreatic tissue 
(12.6 + 4.3 versus 1.0 + 0.9; Fig. 1b). Thus, increased lysosomal bio- 
genesis accompanies the expanded autophagosome compartment in 
PDA and may facilitate high levels of autophagic flux. Consistent with 
transcriptional control of these organellar changes, gene set enrich- 
ment analysis (GSEA) of multiple independent data sets revealed that 
human PDA specimens have elevated expression of autophagy-lyso- 
some genes compared with normal pancreatic tissue (Fig. 1c, Extended 
Data Fig. 1b and Supplementary Tables 1, 2). Accordingly, immuno- 
histochemistry confirmed upregulation of autophagy-lysosome pro- 
teins in the tumour epithelium (Fig. 1d). 

In normal cells exposed to nutrient stress, the biogenesis of both 
organelles is under transcriptional regulation by the MiT/TFE sub- 
class of basic helix-loop-helix transcription factors®’. RNA- 
sequencing (RNA-seq) data across ten common solid tumour types 
revealed high relative expression of these factors in PDA, with levels 
only exceeded in melanoma and kidney cancers, in which MiT/TFE 
transcription factors are established oncogenes (Extended Data 


Fig. 1c). Immunohistochemistry demonstrated overexpression of 
nuclear-localized TFE3 in the neoplastic epithelium in a subset of 
PDA (staining scores =2 in 23% of PDA versus 3% of normal pan- 
creas specimens; P< 0.001) (Fig. le and Extended Data Fig. 1d; see 
Methods). Similarly, microdissected specimens and xenografts 
exhibited frequent upregulation of TFEB and MITF messenger 
RNA in PDA cells relative to normal ductal epithelium, and subsets 
of PDA cell lines showed MiT/TFE overexpression compared with 
HPDE cells (Extended Data Fig. le-h). Generally, a single MiT/TFE 
family member predominated in individual specimens. 

GSEA of human primary PDA data sets and cultured PDA cell lines 
showed strong correlation between expression of MiT/TFE factors and 
the autophagy-lysosome signature (Fig. 1f and Extended Data 
Fig. li-k). Accordingly, knockdown of TFE3 in the 8988T PDA cell 
line (TFE3-high, TFEB/MITF-low) resulted in prominent repression 
of this signature (global RNA-seq; Fig. 1g, h). Chromatin immunopre- 
cipitation (ChIP) confirmed that MITF and TFE3 bound to multiple 
autophagy and lysosome genes bearing a consensus coordinated lyso- 
somal expression and regulation (CLEAR) element”* in PDA cells 
(Extended Data Fig. 2a). Moreover, knockdown of MITF, TFE3 or 
TFEB caused downregulation of numerous CLEAR-bearing genes in 
a series of PDA cell lines with high relative expression of that MiT/TFE 
family member, whereas no significant changes were seen in non- 
transformed pancreatic lines (HPDE and HPNE) or in a pancreatic 
neuroendocrine tumour cell line (QGP1) (Extended Data Fig. 2b-d). 
Expression of RNA interference (RNAi)-resistant MITF or TFE3 com- 
plementary DNA restored target gene expression whereas dominant- 
negative MITF recapitulated the effects seen with RNAi (Extended 
Data Fig. 2e, f). Thus, MiT/TFE proteins act selectively in PDA cells 
to regulate a broad autophagy-lysosome program under basal condi- 
tions. 

In non-transformed cells grown in nutrient-replete conditions, the 
MiT/TFE proteins are phosphorylated by mTORCI at the lysosome 
membrane, leading to their interaction with 14-3-3 proteins and 
cytoplasmic retention, whereas mTORC1 inactivation upon star- 
vation enables their nuclear translocation’ ’'!. Correspondingly, 
HPDE and HPNE cells exhibited predominantly cytoplasmic res- 
idence of endogenous and ectopically expressed TFE3, MITF and 
TFEB under full nutrient conditions, and showed nuclear transloca- 
tion after starvation or treatment with the mTOR inhibitor Torin 1 
(Fig. 2a-c and Extended Data Fig. 3a). In stark contrast, a series of 
PDA cell lines showed constitutive nuclear localization of each MiT/ 
TFE protein, regardless of nutrient status or treatment with Torin 1 or 
with inhibitors of MEK, another pathway implicated in their regu- 
lation’ (Fig. 2a-c and Extended Data Fig. 3b-i). Immunoblot for 
phospho-p70S6K, a readout of mTORC1 activity, and immunostain- 
ing for mTOR and LAMP2, indicated that mTORC1 was active and 
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Figure 1 | Coordinate induction of an autophagy-lysosome gene program 
in PDA by MiT/TFE proteins. a, Immunofluorescence staining showing 
extensive overlap of autophagosomes (LC3) and lysosomes (LAMP2) in 8988T 
cells compared with HPDE cells. b, Representative transmission electron 
micrographs showing increased abundance of lysosomes in PDA compared 
with normal pancreas. Relative lysosome numbers per cell are quantified 

(see Methods). N = 473 cells from 4 normal specimens and 406 cells from 3 
PDA specimens.**P < 0.001. N, normal; T, tumour. c, Upregulation of 
autophagy-lysosome genes in PDA relative to matched normal tissue (see 
Supplementary Table 2). d, e, Immunohistochemistry showing upregulation of 
autophagy and lysosomal proteins (d) and nuclear-localized TFE3 (e) in the 
PDA epithelium (filled arrowheads) compared with normal pancreas or 


demonstrated amino-acid-regulated lysosomal association in PDA 
cells (Extended Data Fig. 3j, k). In addition, all cells exhibited co- 
immunoprecipitation of TFE3 and MITF with 14-3-3 and loss of 
binding upon Torin 1 treatment, although the fractional binding in 
PDA cells was lower, consistent with the predominantly nuclear 
residence of the MiT/TFE proteins (Extended Data Fig. 31). Thus, 
PDA cells show constitutive MiT/TFE activation despite displaying 
intact mTORCI signalling. 

To uncover machinery that overrides cytoplasmic retention 
mechanisms, we employed affinity purification and quantitative pro- 
teomics’’ to identify PDA-specific TFE3-interacting proteins. Notably, 
importin 8 (IPO8), a member of the importin-B family of nucleocy- 
toplasmic transporters'*"’, was significantly enriched in TFE3 immu- 
noprecipitates in PDA cells but not in HPDE cells (Extended Data Fig. 
4a). We confirmed that Flag-tagged TFE3 bound endogenous IPO8 
specifically in PDA cells (Fig. 2d and Extended Data Fig. 4b). In addi- 
tion, IPO8 was expressed at elevated levels in human PDA specimens 
and cell lines compared with controls (Fig. 2e and Extended Data 
Fig. 4c, d). 

IPO8 and its most closely related homologue, IPO7, direct nuclear 
import of specific cargo, including growth regulatory proteins’*””. 
Accordingly, IPO8 knockdown caused a marked decrease in nuclear 
TFE3 and reduced overall TFE3 protein levels in multiple PDA cell 
lines (Fig. 2f, left, and Extended Data Fig. 4e). Similar effects 
were observed for both MITF and TFEB upon combined IPO8 and 
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stromal cells (open arrowheads). e, Graph shows quantification of TFE3 
staining intensity (0 = no staining to 3 = high staining) in normal (N = 31) 
and PDA samples (N = 354). f, GSEA showing correlation between MiT/ 
TFE (3TF) expression and the autophagy-lysosome gene signature in 
primary human PDA. FDR, false discovery rate; NES, normalized enrichment 
score; Nom., nominal. g, h, TFE3 knockdown in 8988T cells causes 
coordinated downregulation of autophagy-lysosome genes. g, RNA-seq values 
from N = 3 independent experiments; Error bars represent mean + standard 
deviation (s.d.). P< 0.05 for each gene. h, GSEA. Scale bars: 11 jum (a); 500 nm 
(b); 50 um (d); 20 tum (e). Significance was analysed using two-tailed Student’s 
t-test. 


IPO7 knockdown (Fig. 2f, right, bottom and Extended Data Fig. 4f). 
Importantly, in non-transformed HPDE and HPNE cells, IPO8 
knockdown did not affect basal (cytoplasmic) TFE3 levels or 
Torin-l-induced nuclear translocation of TFE3 (Extended Data 
Fig. 4g, h). Moreover, MiT/TFE mRNA levels were not significantly 
altered by IPO7/IPO8 inactivation (Extended Data Fig. 4i). These 
findings suggest that importins may act selectively in PDA to regu- 
late the stability of MiT/TFE proteins in addition to facilitating their 
nuclear transport. Indeed, cycloheximide treatment studies revealed 
that IPO8 knockdown accelerated TFE3 turnover in 8988T cells 
(Extended Data Fig. 4j). Thus, IPO8-dependent nuclear accumula- 
tion of MiT/TFE factors results in their stabilization and upregula- 
tion of their transcriptional programs in PDA cells regardless of 
nutrient status. 

We found that constitutive activation of MiT/TFE proteins was 
critical for autophagy-lysosome function in PDA cells. Depletion of 
MiT/TFE proteins across a series of PDA cell lines resulted in striking 
defects in lysosome morphology and increased lysosome diameter, 
(986.4 + 30.7 nm versus 2,722 + 72.6 nm for control siRNA (siCTRL) 
and siTFE3, respectively), effects commonly associated with lysosomal 
stress and defective proteolysis” (Fig. 2g and Extended Data Fig. 5a), 
whereas HPDE cells were minimally affected (Extended Data Fig. 5b). 
These responses were phenocopied by treatment of PDA cells 
with the vacuolar-type H* -ATPase inhibitor Bafilomycin Al (BafA1) 
(Fig. 2g). Correspondingly, transmission electron microscopy and 
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Figure 2 | Constitutive nuclear import of MiT/TFE factors controls 
autophagy-lysosome function in PDA. a, b, Fluorescence microscopy 
showing localization of GFP-MITF (a) and GFP-TFE3 (b) in the indicated cells 
under fed and starved (3h Hank’s balanced salt solution (HBSS)) conditions. 
DAPI, 4',6-diamidino-2-phenylindole. c, Immunoblots for TFE3 (left), MITF 
(middle) and TFEB (right) in cytoplasmic (C) and nuclear (N) fractions of cells 
treated with vehicle or Torin 1 (250 nM) for 1h. d, Flag~TFE3 was stably 
expressed in cells and lysates were immunoprecipitated with anti-Flag antibody 
or control IgG and immunoblotted for Flag (TFE3) or IPO8. e, Representative 
IPO8 immunostaining in human specimens. f, PDA cell lines were transfected 
with the indicated siRNAs, fractionated and immunoblotted for TFE3 (8988T 
cells; left), MITF (PL18 cells; right) or TFEB (8902 cells; bottom). g, TFE3 
knockdown causes aberrant lysosomal morphology and increased size as 
shown by LAMP? staining. Inset: magnified view. Graph (right): quantification 
of lysosome diameter in cells expressing shGFP (N = 151), sh TFE3 (N = 193), 


immunofluorescence microscopy of TFE3-knockdown cells revealed 
accumulation of autolysosomes containing undigested cargo (Fig. 2h 
and Extended Data Fig. 5c, d). Moreover, there was a pronounced 
increase in lysosomal pH as demonstrated using the pH-sensitive 
dye Oregon Green 514 coupled to dextran (Fig. 2i). In addition, 
mCherry-GFP-LC3 autophagy reporter assays revealed that TFE3 
inactivation decreased autophagic flux in 8988T cells (55.7 + 9.6% 
mCherry*/GFP~ spots in siCTRL versus 19.8+2.9% in siTFE3; 
Fig. 2j). Finally, to assay proteolytic activity, 8988T cells were fed 
boron-dipyrromethene (BODIPY)-dye-conjugated bovine serum 
albumin (BSA; DQ-BSA), which is taken up by macropinocytosis 
and fluoresces after proteolysis in lysosomes. TFE3 knockdown or 
BafAl treatment resulted in dramatic decreases in fluorescent 
puncta co-localized with LAMP2 (14.88+6.1 spots per cell in 
siCTRL versus 3.7+1.8 and 0.2+0.7 spots per cell in siTFE3 
and BafA1l, respectively; Fig. 2k). Tetramethylrhodamine (TMR)- 
dextran uptake was not affected, indicating specific impairment in 
lysosomal proteolysis rather than in internalization of extracellular 
cargo (data not shown). Notably, lysosomal breakdown of extracel- 
lular albumin scavenged through macropinocytosis is an important 
nutrient source in PDA”. Thus, by governing both autophagic flux 
and lysosomal catabolism, the MiT/TFE proteins support an inte- 
grated cellular clearance program that enables efficient processing 


or treated with BafAl (N = 52). **P < 0.0001; bar, mean. h, Electron 
micrographs showing accumulation of undigested cargo in lysosomes upon 
TFE3 knockdown. i, Measurement of lysosomal pH in 8988T cells transfected 
with the indicated siRNAs. N = 3 independent experiments; **P < 0.001 (see 
Methods). j, Quantification of total number of autolysosomes (mCherry*/ 
GFP " spots) from N = 10 cells per condition using a tandem mCherry-GFP- 
LC3 reporter. Error bars represent mean + standard error of the mean (s.e.m.). 
**P < 0.0001. k, Proteolysis of macropinocytosed protein is impaired by TFE3 
knockdown or BafA1 treatment as determined by pulse-chase with DQ-BSA. 
Degradation of DQ-BSA in lysosomes is quantified (number of fluorescent 
spots per cell co-localizing with LAMP2* lysosomes for N = 3 independent 
experiments with at least 50 cells scored per experiment). **P < 0.0001. Scale 
bars: 18 um (a, b); 50 tum (e); 7.5 jum (g). Significance was analysed using two- 
tailed Student's t-test. 


of cargo from autophagy as well as macropinocytosis, providing 
PDA cells with access to critical sources of both intracellular and 
extracellular nutrients. 

MITF or TFE3 overexpression in pancreatic cells reinforced these 
findings, demonstrating induction of autophagy-lysosome genes, 
LC3B foci and lipidated LC3-II that was further enhanced after treat- 
ment with chloroquine, an inhibitor of lysosome acidification, indi- 
cating marked augmentation of autophagic flux (Extended Data 
Fig. 6a-e). 

Cargo degraded in the lysosome generates metabolic intermediates 
that may feed into multiple pathways”**. To obtain insight into the 
metabolic outputs of MiT/TFE-controlled autophagy-lysosome func- 
tion, we conducted global metabolite profiling of 8988T and PSN1 cells 
transfected with TFE3-targeted siRNAs. These studies revealed stat- 
istically significant changes common to both cell lines in 15.2% (53/ 
347) of detected metabolites. Most prominently altered were amino 
acids and their breakdown products, with 31% (25/80) showing 
decreased abundance, whereas only restricted changes were observed 
in other general metabolite groupings (Fig. 3a and Supplementary 
Table 3). TFE3 silencing did not affect amino acid uptake (Extended 
Data Fig. 7a), suggesting that the autophagy-lysosome system may 
supply a considerable fraction of intracellular amino acids irrespective 
of their external availability in PDA. In support of this model, TFE3 
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Figure 3 | MiT/TFE proteins maintain autolysosome-derived pools of 
amino acids. a, Global metabolite profiling of 8988T and PSN] cells after TFE3 
knockdown reveals a preferential decrease in amino acids compared with other 
general metabolite groups (see Methods). Grey, no change; red, increase; green, 
decrease. b, Quantification of fold change in amino acid levels in 8988T cells 
transfected with siCTRL or siTFE3. *P < 0.03, **P < 0.003. c, BafAl causes a 
decrease in amino acid levels in PDA cells (left), while amino acids are 
unchanged or increased in HPDE cells (right). *P < 0.05, **P < 0.005. 

d, Immunoblot for p-p70S6K levels in the indicated cells at sequential times 


knockdown, BafA1 treatment, and inactivation of autophagy by ATG5 
knockdown significantly decreased intracellular amino acid levels in 
PDA cell lines but not in control cells (Fig. 3b, cand Extended Data Fig. 
7b-d), and led to activation of AMP-activated protein kinase (AMPK) 
and decreased cellular ATP levels, suggesting induction of energy 
stress (Extended Data Fig. 8a-c). Reciprocally, overexpression of 
MITF or TFE3 in HDPE cells caused downregulation of AMPK sig- 
nalling (Extended Data Fig. 8d). 

These findings imply that PDA cells should have increased capacity 
to buffer amino acid levels under conditions of nutrient deprivation. 
To test this, we switched cells to low amino acid concentration media 
(10% of normal) and monitored phosphorylation of p70S6K, a readout 
of mTORCI activity that is sensitive to fluctuations in intracellular 
amino acid levels”. HPDE cells showed extinction of phosphorylated 
(p)-p70S6K within 5-15 min in 10% amino acid media, whereas PDA 
cells (8988T and PSN1) maintained robust levels for >30min 
(Fig. 3d). Correspondingly, TFE3 inactivation in 8988T cells greatly 
accelerated the decline in p-p70S6K (Fig. 3e). Reciprocally, MITF over- 
expression in HPDE cells caused p-p70S6K levels to be sustained upon 
10% amino acid starvation, and markedly increased clonogenic growth 
in this setting while not affecting clonogenicity in full nutrients (Fig. 3f, 
g). Taken together, these data indicate that PDA cells rely on autop- 
hagy-lysosome function for maintenance of intracellular amino acid 
pools, which facilitates survival in response to nutrient stress. 

Consistent with key roles of MiT/TFE proteins in organelle function 
and metabolic regulation, the growth of PDA cell lines expressing high 
endogenous levels of MITF, TFE3 or TFEB was significantly impaired 
by knockdown of the relevant factor, an effect rescued by co-express- 
ion of short hairpin RNA (shRNA)-resistant cDNAs (Fig. 4a, b and 
Extended Data Fig. 9a). By contrast, the growth of HPNE and QGP1 
cells was unaffected (Extended Data Fig. 9a). As previously reported, 
PDA cells were broadly sensitive to chloroquine treatment compared 
with control cells* (Extended Data Fig. 9b). Moreover, ectopic 
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after 10% amino acid (AA) starvation. e, p-p70S6K immunoblots in 8988T cells 
transfected with siCTRL or siTFE3 and grown under control conditions or in 
0% or 10% amino acids for the indicated times. f, g, HPDE-MITF cells show 
sustained p-p70S6K levels under amino acid starvation (f) and enhanced 
colony formation under transient starvation with 10% amino acids (quantified 
in the graph) (g). *P < 0.001. For all graphs, error bars indicate 

mean + standard deviation (s.d.) for N = 3 independent experiments. 
Significance was analysed using two-tailed Student’s t-test. 


expression of MITF or TFE3 rendered HPNE cells hypersensitive to 
chloroquine, linking MiT/TFE-regulated clearance pathways to these 
growth phenotypes (Fig. 4c). 

To extend our findings in primary patient-derived samples, we 
examined a series of early passage PDA cultures. Importantly, 
these cells also showed high basal autophagy, nuclear-localized MiT/ 
TFE proteins, and MiT/TFE-dependent autophagy-lysosome gene 
expression, organelle function, and colony-forming ability (Extended 
Data Fig. 9c-g). 

Notably, knockdown of TFE3 and MITF virtually abolished xeno- 
graft tumour growth of Pancl and 8988T cells, and PL18 cells, respect- 
ively (Fig. 4d and Extended Data Fig. 9h). Conversely, MITF 
overexpression enhanced the tumorigenicity of primary Kras@!7P- 
expressing mouse pancreatic epithelial cells. Whereas Kras*!”? control 
cells formed only focal low-grade pancreatic intraepithelial neoplasia 
(PanIN)-like lesions by 6 weeks after orthotopic injection, co-express- 
ion of MITF resulted in large invasive tumours in 3/5 mice (Fig. 4e and 
Extended Data Fig. 9i). Thus, in keeping with their requirement for 
augmenting autophagy-lysosome function, MiT/TFE proteins are 
potent drivers of PDA pathogenesis in vivo. 

Our work places a new focus on lysosome regulation by MiT/TFE 
proteins as a nexus for metabolic reprogramming in PDA cells. 
Increased lysosome activity integrates major routes for nutrient scav- 
enging—autophagy and macropinocytosis—thereby maintaining 
intracellular amino acid availability (Fig. 4f). Escape of MiT/TFE fac- 
tors from inhibition by mTORCI enables cancer cells to maintain 
robust activation of anabolic pathways while simultaneously benefit- 
ing from the metabolic fine-tuning and adaptation to stress afforded by 
activation of autophagy and lysosomal catabolism. Thus, we propose 
that lysosome activation under the control of the MiT/TFE transcrip- 
tional program is a novel hallmark of PDA. These findings, together 
with recent work”*”’, suggest that targeting aberrant lysosomal func- 
tion has a potential therapeutic benefit in cancer. 
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Figure 4 | MiT/TFE proteins are required for PDA growth. a, Knockdown of 
the indicated MiT/TFE factors impairs in vitro colony-forming ability in a 
panel of PDA cell lines. b, Expression of shRNA-resistant MITF in PL18 cells 
(left) or TFE3 in Pancl cells (right) rescues growth after shRNA-mediated 
knockdown of MITF or TFE3. Error bars indicate mean + s.d. for N = 3 
independent experiments; **P < 0.001. EV, empty vector; M, MITF; shC, 
shCTRL; shM, shMITF; shT3, shTFE3; T3, TFE3. c, Forced expression of MITF 
or TFE3 sensitizes HPNE cells to growth inhibition by chloroquine (CQ). Error 
bars indicate mean + s.d. for N = 3 independent experiments; *P < 0.005, 
**P < 0.0001. d, TFE3 knockdown with two independent shRNAs significantly 
impairs subcutaneous xenograft growth of Pancl cells. Error bars indicate 
mean + s.e.m. for N = 6 tumours per group; *P < 0.03. e, MITF 
overexpression in Kras@'?? mouse PanIN cells promotes tumorigenesis upon 
orthotopic injection. N = 5 mice per group; **P < 0.001. Scale bar, 200 jum. 

f, In PDA, the MiT/TFE proteins are upregulated and exhibit increased 
nuclear residence due to IPO8/IPO7-mediated trafficking, leading to 
transcriptional induction of autophagy-lysosome genes and increased 
biogenesis and function of these organelles. Autophagy-lysosome expansion 
integrates two major pathways for nutrient scavenging and metabolic 
adaptation in PDA, autophagy and digestion of serum proteins obtained by 
macropinocytosis. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Materials. Reagents were obtained from the following sources: antibodies against 
MITF (sc-71587) from Santa Cruz Biotechnology; GAPDH (MAB374) from 
Millipore; TFE3 from Cell Marque (MRQ-37; 354R-14)); LAMP2 (ab-25631) 
from Abcam; TFEB (4240), LC3B (3868), phospho-T389 S6K1 (9234), S6K1 
(2708), phospho-AMPK (Thr172) (2535), AMPK (2603), phospho-ACC 
(Ser 79) (3661), ACC (3676), Lamin (2032), mTOR (2893) and the Flag epitope 
(2368) from Cell Signaling Technology; IPO8 (NBP2-24751) and LC3-II (NB600- 
1384) from Novus Biologicals; CTSS (HPA002988), ATP6V1H (HPA023421) and 
ULK2 (HPA009027) from Sigma-Aldrich; Flag M2 affinity gel, amino acids from 
Sigma Aldrich; RPMI, DMEM, fetal bovine serum (FBS) and dialysed FBS (dFBS), 
Alexa 488- and 568-conjugated secondary antibodies; 70kDa dextran-Oregon 
Green 514, DQ-Green-BSA from Invitrogen; amino-acid-free RMPI from US 
Biologicals; Bafilomycin Al from Tocris. 

Cell culture. Cell lines were obtained from the American Type Culture Collection. 
Patient-derived PDA cultures were generated at Massachusetts General Hospital 
(MGH). Informed consent was obtained from all subjects. STR profiling was 
performed for all cell lines by the MGH Center for Molecular Therapeutics. 
Primary human samples and PDA cell lines were generated from ascites fluid 
under Institutional Review Board (IRB)-approved protocols 02-240 and 
2007P001918 (see also Supplementary Table 4). All lines used had verified activ- 
ating KRAS mutations. Cells were cultured in the following media: 8988T, Pancl, 
PL18, PSN1, 8902, in DMEM supplemented with 10% FBS; HupT3, KLM1 in 
RPMI with 10% FBS; HPDE cells were cultured as described previously”’. Negative 
mycoplasma contamination status of all cell lines and primary cells used in the 
study was established using LookOut Mycoplasma PCR Kit (Sigma, MP0035). 
Constructs. N-terminal Flag-tagged human MITF-H isoform expression con- 
structs were generated by subcloning the cDNA of MITF-H (Origene) into the 
Nhel and EcoRI sites of the pLJM1 (Addgene) lentiviral vector, the NotI and EcoRI 
sites of the pRetroX-Tight-Pur (Clonetech) doxycycline-inducible vector or the 
EcoRI and Sall sites of the pBabe-puro retroviral vector. Dominant-negative 
MITF-H isoform lacking the 5’ transactivation domain was generated as prev- 
iously described for the MITF-M isoform*”. Briefly, the first 280 amino acids 
corresponding to 840 bp was deleted followed by deletion of R301 (corresponding 
to R218 of MITF-M). This cDNA was cloned into the NhelI and EcoRI] sites of 
pLJM1. N-terminal Flag-tagged full-length human TFE3 expression constructs 
were generated by subcloning the cDNA of TFE3 from pcDNA3.1-(HA) -TFE3 
(obtained from D. Fisher) into the BamHI and EcoRI sites of the pBabe-puro 
retroviral vector. GFP-tagged MITF and TFE3 were generated by subcloning the 
cDNA of MITF-H into BglII and Sall sites and TFE3 into HindIII and BamHI sites 
of the pEGFP-C1 (Clonetech) vector. siRNA-resistant expression constructs of 
MITF and TFE3 were generated via QuikChange Site-Directed Mutagenesis 
(Agilent Technologies). mCherry-GFP-LC3 was a gift from A. Kimmelman. 
siRNA and lentiviral-mediated shRNA targets. siRNAs against MITF, TFE3, 
TFEB, ATGS5 and IPO8 were purchased from Ambion and Thermo Scientific. The 
RNAi Consortium clone IDs for the shRNAs used in this study are as follows: 
shMITF, TRCN0000329793 (CGGGAAACTTGATTGATCTTT), TRCNO 
000019122 (CGTGGACTATATCCGAAAGTT); shTFEB, TRCN0000013108 
(CCCACTTTGGTGCTAATAGCT), | TRCN0000013109 (CGATGTCCTT 
GGCTACATCAA). shRNA constructs against human TFE3 were generated by 
annealing oligonucleotides against human TFE3 (corresponding to the sequence 
of the siRNA described later) into the Agel and EcoRI sites of pLKO.1-puro vector. 
TFE3 target sequences are the following: GCAGCTCCGAATTCAGGAACT; 
CGCAGGCGATTCAACATTAAC; GGAATCTGCTTGATGTGTACA. 

Cell proliferation and colony formation assay. Cells were plated in 12-well 
plates at 30,000 cells per well in 2 ml of complete culture media or in 6-well plates 
at 2,000 cells per well in 2 ml of media. At the indicated time points, cells were 
trypsinized and counted. Colony plates were fixed 7 days post-plating in 4% 
paraformaldehyde and stained with 0.1% crystal violet. 

Amino acid starvation and BafA1 treatment. Ten per cent amino acid (AA) 
starvation was conducted by incubating cells in AA-free RMPI supplemented with 
10 mM glucose, and 10% AA calculated relative to levels present in RPMI media. 
Stock solutions of AA were made from individual powders. Where BafA1 treat- 
ment was performed, cells were incubated with 150nM BafAl. Acute AA star- 
vation during colony formation was conducted by incubating cells in AA-free 
media for 18 h on day 2 post-plating. Media was then changed to complete growth 
media for an additional 6 days of cell growth. 

SDS-PAGE analysis. Cells were lysed in ice-cold lysis buffer (150 mM NaCl, 
20mM Tris (pH 7.5), 1mM EDTA, 1mM EGTA, 1% Triton X-100, 2.5mM 
sodium pryophosphate, 1 mM f-glycerophosphate, 1 mM sodium vanadate, and 
one tablet of EDTA-free protease inhibitors (Roche) per 10 ml). Samples were 
clarified by centrifugation and protein content measured using BCA protein 
assay kit (Thermo Scientific). Thirty milligrams of protein was resolved on 9% 


SDS-PAGE gels and transferred onto PVDF membranes (GE Healthcare Life 
Sciences). Membranes were blocked in Tris-buffered saline (TBS) containing 
5% non-fat dry milk and 0.1% Tween 20 (TBS-T), before incubation with primary 
antibody overnight at 4°C. The membranes were then washed with TBS-T fol- 
lowed by exposure to the appropriate horseradish peroxidase (HRP)-conjugated 
secondary antibody for 45 min and visualized on Kodak X-ray film using the 
enhanced chemiluminescence (ECL) detection system (Thermo Scientific). 
Subcellular fractionation was performed using the Thermo Scientific NE-PER 
nuclear and cytoplasmic fractionation kit according to the manufacturer’s proce- 
dures. For immunoprecipitation experiments, 1-3 mg of clarified protein lysate 
from cell lines stably expressing Flag-tagged MiT/TFE proteins was captured on 
anti-Flag antibody-conjugated protein A-agarose beads (M2-Flag affinity gel; 
Sigma-Aldrich). Immune complexes were washed and analysed via SDS-PAGE 
and western blotting. Immunoglobulin G antibody was used as a control. 
Metabolite extraction. Samples were extracted while cells were in the exponential 
growth phase. Wells were washed with 1 PBS and aspirated. Six-hundred micro- 
litres of —80 °C MeOH:H,0 (50:50) was added to each well and the plate was then 
placed at —80 °C for 1-1.5h. As plates were thawing on ice, cells were detached by 
scraping and transferred to a 1.5 ml tube and placed on dry ice. Samples were 
vortexed at 4°C for 30s. To clarify homogenates, samples were centrifuged at 
20,000g, for 7 min at 4°C. Clarified supernatants were transferred to new 1.5 ml 
tubes. Six-hundred microlitres of choloroform (stabilized with amylene) was 
added to each sample, one at a time, and the sample was immediately vortexed 
at room temperature for 30s, and then placed on ice. Phase separation was 
achieved by centrifugation at 20,000g, for 0.25 h at 4 °C. The aqueous (polar) phase 
was extracted and transferred to a fresh 1.5 ml tube. The samples were evaporated 
by SpeedVac, snap frozen in liquid nitrogen, and stored at —80°C for further 
processing. 

Derivatization (all samples). Evaporated samples were warmed to room temper- 
ature by quickly spinning them in a SpeedVac. Samples were dissolved in 30 ul 
of 2% methoxyamine hydrochloride in pyridine (MOX) (Pierce) at 37°C for 
1.5h. Samples were derivatized by adding 45 ul of N-methyl-N-(tert-butyldi- 
methylsilyl)trifluoroacetamide (MBTSTFA) + 1% tert-butyldimethylchlorosilane 
(TBDMCS; Pierce) at 60°C for 1h. 

Gas chromatography-tandem mass spectrometry analysis. Gas chromato- 
graphy—mass spectrometry (GC-MS) analysis was performed as described*'’. 
Briefly, analysis was performed on an Agilent 6890 GC instrument that contained 
a 30m DB-35MS capillary column, which was interfaced to an Agilent 5975B MS. 
Electron impact (EI) ionization was set at 70 eV. Each analysis was operated in 
scanning mode, recording mass-to-charge-ratio spectra in the range of 100-605 
m/z. For each sample, 1 il was injected at 270 °C, using helium as the carrier gas at 
a flow rate of 1 ml min’ !. To mobilize metabolites, the GC oven temperature was 
held at 100°C for 3 min and increased to 300°C at 3.5°C min '. Samples were 
analysed in triplicate. Metabolite levels where normalized to cell number and/or 
protein and plotted as fold change relative to control conditions. 
Immunofluorescence assays. Cells were plated on fibronectin-coated glass cover- 
slips at 100,000-300,000 cells per coverslip. Twelve-to-sixteen hours later, the 
slides were rinsed with PBS once and fixed for 15 min with 4% paraformaldehyde 
at room temperature or for 5 min with —20°C methanol. The slides were rinsed 
twice with PBS and cells were permeabilized with 0.05% Triton X-100 for 2 min. 
After rinsing twice with PBS, the slides were incubated with primary antibody in 
5% normal goat serum for 1h at room temperature, rinsed four times with PBS, 
and incubated with secondary antibodies produced in goat (diluted 1:400 in 5% 
normal goat serum) for 45min at room temperature in the dark. Slides were 
mounted on glass slides using Vectashield (Vector Laboratories) and imaged on 
a spinning disk confocal system (Perkin Elmer) or a Zeiss Laser Scanning 
Microscope (LSM) 710. Images were processed using ImageJ and Adobe 
PhotoshopCS4. For quantification of lysosome diameter, Image] software was 
used to draw lines across 130-290 lysosomes from 3-5 fields containing 4-6 cells 
per field. Line pixel values were converted into 1m based on objective magnifica- 
tion and camera pixel size. Presented data are representative of 3-5 replicate 
experiments. DQ-BSA experiments were performed as previously described’. 
Briefly, cells were incubated with DQ-BSA for 30 min and subsequently chased 
for 1 hin DQ-BSA-free media. Where indicated, treatment with BafA1 was for 1h 
at 100nM. The number of DQ-BSA spots co-localizing with LAMP2-positive 
lysosomes was counted. Measurement of autolysosome maturation was per- 
formed in cells transfected with the mCherry-GFP-LC3 reporter. Cells were 
transfected with siCTRL or siMITF/siTFE3 and 48h later transfected with the 
mCherry-GFP-LC3. After 24h, cells were fixed in 4% PFA and quantification of 
total number of autolysosomes (mCherry*/GFP™ spots) compared to total spots 
per cell (N > 1,000 total spots from N = 10 cells per condition) for at least three 
independent experiments was counted. 
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Histology and immunostaining. Tissue samples were fixed overnight in 4% 
buffered formaldehyde, and then embedded in paraffin and _ sectioned 
(5mm thickness) by the DF/HCC Research Pathology Core. Haematoxylin and 
eosin staining was performed using standard methods. For immunohistochem- 
istry, unstained slides were baked at 55°C overnight, deparaffinized in xylenes 
(two treatments, 6 min each), rehydrated sequentially in ethanol (5 min in 100%, 
3 min in 95%, 3 min in 75%, and 3 min in 40%), and washed for 5 min in 0.3% 
Triton X-100/PBS (PBST) and 3 min in water. For antigen unmasking, specimens 
were cooked in a 2100 Antigen Retriever (Aptum Biologics) in 1X Antigen 
Unmasking Solution, Citric Acid Based (H-3300, Vector Laboratories), rinsed 
three times with PBST, incubated for 10 min with 1% H,O, at room temperature 
to block endogenous peroxidase activity, washed three times with PBST, and 
blocked with 5% goat serum in PBST for 1h. Primary antibodies were diluted 
in blocking solution as follows: anti-TFE3 (Cell Marque, 354R-14) 1:100; anti- 
CTSS (Sigma, HPA002988) 1:200; anti-ATP6V1H (Sigma, HPA023421) 1:100; 
anti-ULK2 (Sigma, HPA009027) 1:100, and incubated with the tissue sections at 
4°C overnight. The evaluation of MITF and TFEB protein levels in patient PDA 
tissue could not be evaluated due to lack of antibodies suitable for immunohisto- 
chemistry (IHC). Commercially available MITF antibodies for IHC are specific to 
the melanoma (MITF-M) isoform or cross-react with other MiT/TFE family 
members. Specimens were then washed three times for 3 min each in PBST and 
incubated with biotinylated secondary antibody (Vector Laboratories) in blocking 
solution for 1h at room temperature. Specimens were then washed three times in 
PBST and treated with ABC reagent (Vectastain ABC kit #PK-6100) for 30 min, 
followed by three washes for 3 min each. Finally, slides were stained for peroxidase 
for 3min with the DAB (diaminebenzidine) substrate kit (SK-4100, Vector 
Laboratories), washed with water and counterstained with haematoxylin. 
Immunostaining was assessed by semi-quantitative analysis of tissue microarray, 
quantifying the intensity of stained tumour cells scored on a scale from 0 (no 
staining) to 3 (strongest intensity). Stained slides were photographed with an 
OlympusDP72 microscope. 

For human samples, PDA samples used for IHC and generation of early passage 

PDA cultures were obtained under IRB-approved protocols 02-240 and 
2007P001918. The samples for IHC were from patients resected at MGH between 
2001 and 2012. Pathological diagnosis of PDA was confirmed by a gastrointestinal 
cancer pathologist at MGH (V.D.). Early passage PDA cultures were derived from 
ascites specimens. Patients were verified to have PDA from biopsies. 
Electron microscopy. For in vitro studies, tissue culture specimens in confluent 
6-well or 10-cm plates were fixed directly with gentle rocking for 15 min at room 
temperature with a glutaraldehyde fixative (2.5% glutaraldehyde, 2% para- 
formaldehyde, 0.025% calcium chloride, in a 0.1M sodium cacodylate buffer, 
pH 7.4), gently scraped and pelleted at 500g, and washed twice with cacodylate 
buffer. To make a cell block, the cells were centrifuged and resuspended in warm 
(60 °C) 2% agar in a warm water bath to keep the agar fluid. The cells were then 
centrifuged again and the agar allowed to harden in an ice water bath. The tissue- 
containing tip of the centrifuge tube was cut off, resulting in an agar block with the 
cells embedded within it. This agar block was subsequent routinely processed in a 
Leica Lynx automatic tissue processor. Briefly, the tissue was post-fixed in osmium 
tetroxide, stained en bloc with uranyl acetate, dehydrated in graded ethanol solu- 
tions, infiltrated with propylene oxide/Epon mixtures, embedded in pure Epon, 
and polymerized overnight at 60 °C. One-micrometre sections were cut, stained 
with toluidine blue, and examined by light microscopy. Representative areas were 
chosen for electron microscopic study and the Epon blocks trimmed accordingly. 
Thin sections were cut with an LKB 8801 ultramicrotome and diamond knife, 
stained with lead citrate, and examined in a FEI Morgagni transmission electron 
microscope. Images were captured with an AMT digital CCD camera. 

For analysis of human tissue, fresh biopsy tissue placed into Karnovsky’s KII 
Solution (2.5% glutaraldehyde, 2.0% paraformaldehyde, 0.025% calcium chloride, 
ina 0.1 M sodium cacodylate buffer, pH 7.4), fixed overnight at 4 °C, and stored in 
cold buffer. Subsequently, they were post-fixed in osmium tetroxide, stained en 
bloc with uranyl acetate, dehydrated in graded ethanol solutions, infiltrated with 
propylene oxide/Epon mixtures, flat embedded in pure Epon, and polymerized 
overnight at 60 °C. One-micrometre sections were cut, stained with toluidine blue, 
and examined by light microscopy. Representative areas were chosen for electron 
microscopic study and the Epon blocks were trimmed accordingly. Thin sections 
were cut with an LKB 8801 ultramicrotome and diamond knife, stained with Sato’s 
lead, and examined in a FEI Morgagni transmission electron microscope. Images 
were captured with an Advanced Microscopy Techniques 2K digital CCD camera. 
Measurement of lysosomal pH. Determination of lysosomal pH was carried out 
on intact cells by ratiometric imaging of the pH-sensitive dye Oregon Green 514, 
as described**”*. 8988T cells were transfected with control siRNA or TFE3 siRNA. 
One day before the experiment, 80,000 cells were plated in black 96-multiwell 
plates in low serum and low glucose media. On the day of the experiment, 
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cells were fed 30pgml* 70kDa dextran conjugated to Oregon Green 514 
(Dx-OG514, Invitrogen) for 6h in full media. Cells were rinsed three times to 
remove excess dye, and chased for 2 h in media free of Dx-OG514, which results in 
the selective accumulation of Dx-OG514 in lysosomes. Dx-OG514 fluorescence 
was collected at 530 nm upon excitation at 440 nm and 490 nm in a Spectramax 
microplate reader (Molecular Devices). The 490/440 fluorescence emission ratios 
were interpolated to a calibration curve. The calibration curve was built by bathing 
cells in a K* isotonic solution (145 mM KCl, 10mM glucose, 1 mM MgCh, and 
20 mM of either HEPES, MES, or acetate) buffered to pH ranging from 3.5 to 7.0 
and containing 10 pg ml’ Nigericin. The 490/440 ratios were plotted as a func- 
tion of pH, and fitted to a Boltzmann sigmoid. All measurements were performed 
in quadruplicate. Presented data are representative of 3 replicate experiments. 
Quantitative RT-PCR. Total cellular RNA was extracted using RNeasy Mini Kit 
(Qiagen) and reverse transcription was performed from 2 1g of total RNA using 
the QuantiTect Reverse Transcription Kit (Qiagen). Quantitative RT-PCR was 
performed with FastStart Universal SYBR Green (Roche) in a MX3005P continu- 
ous fluorescence detector (Stratagene) or using a Lightcycler 480 (Roche). PCR 
reactions were performed in triplicate and the relative amount of cDNA was 
calculated by the comparative CT method using the 18S ribosomal RNA sequences 
as a control. Primer sequences are available upon request. MITF isoform-specific 
PCR was conducted as previously described**. 

ChIP. ChIP experiments were conducted using the Chromatin Immunopre- 
cipitation assay kit (Millipore) according to the manufacturer’s instructions. 
Briefly, 10° cells expressing Flag-tagged MITF or TFE3 were fixed with 1% 
formaldehyde for 10 min (for Flag-MITF experiments, cells were fixed 48 h after 
doxycycline treatment). Cells were washed twice with ice-cold PBS and lysed in 
ice-cold SDS lysis buffer containing protease inhibitors (Roche). Chromatin 
extraction and DNA sonication were conducted according to the manufacturer’s 
protocol. DNA was recovered from immune complexes on M2-Flag affinity gel. 
The immuno-precipitated DNA was recovered and analysed by real-time PCR. 
ChIP primers have been previously described”. 

Xenograft studies. Mice were housed in pathogen-free animal facilities. All 
experiments were conducted under protocol 2005N000148 approved by the 
Subcommittee on Research Animal Care at Massachusetts General Hospital. 
For subcutaneous xenografts, Pancl, 8988T or PL18 cells were infected with 
lentiviral shRNAs targeting TFE3 (n = 2), MITF (n = 2) and GFP (control hair- 
pin, n = 1) and subjected to a puromycin selection (2 1g ml’) in vitro. 1.5-2.5 X 
10° cells, suspended in a 1:1 mixture of HBSS:matrigel, were injected subcuta- 
neously into the lower flank of 3-4 female NOD.CB17-Prkdcscid/] mice per group 
(6-10 weeks of age) (purchase from Jackson Laboratories, strain #001303). 
Tumour length and width were measured weekly and the volume was calculated 
according to the formula (length x width”)/2. All xenograft experiments with 
human PDA lines were approved by the HMS Institutional Animal Care and 
Use Committee (IACUC) under protocol number 04-605. For orthotopic mouse 
studies, MITF or GFP was stably expressed in mouse PanIN primary cells”. 
Alternatively, a doxycycline-inducible expression construct for MITF was also 
stably expressed in mouse PanIN primary cells. 1 X 10° cells were orthotopically 
injected in a 1:1 mixture of matrigel:PBS into the pancreas of SCID mice (n = 5 per 
group). Mice were euthanized 6 weeks after implantation, pancreas were collected 
and submitted for histological examination. No mice were excluded from the 
analysis. Experiments were designed to detect a 50% change in tumour size with 
80% power and a type I error of 5% using the t-test. The experiments were not 
randomized. The investigators were not blinded to allocation during experiments 
and outcome assessment. 

Image analysis and statistics. Image analysis, including densitometry and mea- 
surement of fluorescence intensity, was conducted using Image J software (NIH). 
Statistical analyses of results are expressed as mean + standard deviation, unless 
otherwise specified. Significance was analysed using two-tailed Student’s t-test. A 
P value of less than 0.05 was considered statistically significant. 

Gene expression profiling and GSEA. To build a comprehensive autophagy- 
lysosome gene signature (gene set), we combined published lysosome proteo- 
mics” and autophagy interactome data sets*' together with known lysosomal 
disease associated genes (see Supplementary Table 1). Data sets used for the 
meta-analysis in Fig. 1c, f and Extended Data Fig. 1b, c are accessible from the 
Gene Expression Omnibus (GEO; http://www.ncbi.nlm.nih.gov/gds/), including 
GSE16515, GSE28735 and GSE15471, from The Cancer Genome Atlas (TCGA; 
http://cancergenome.nih.gov/), from the Cancer Cell Line Encyclopedia (CCLE; 
http://www. broadinstitute.org/software/cprg/?q=node/11). For the GEO and 
CCLE data sets, raw expression values in the form of CEL files were collected 
and then processed using RMA in the R bioconductor package. For TCGA data, 
expression data sets were created by combining RNASeqV2 Level3 normalized 
gene result files for individual samples and producing tables with genes in rows 
and samples in columns. Data for the 8988T cells of Fig. 1g, h was processed using 
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a standard RNA-seq pipeline that used Trimmomatic to clip and trim the reads, 
used tophat2 to align the reads to hg19, and used cuffdiff to calculate differential 
expression. GSEA (http://www.broadinstitute.org/gsea/index.jsp) of the express- 
ion data was used to assess enrichment of the autophagy-lysosome gene signature. 
Depending upon the data set, there were several different methods used to rank 
genes for GSEA. In the PDA samples for Fig. 1f and Extended Data Fig. 1i, j genes 
were ranked according to Pearson correlation with a meta-gene formed by the 
mean expression of MITF, TFE3 and TFEB, and P values were obtained by per- 
muting the phenotype (2,500 permutations). In the 8988T cells of Fig. 1g, h, a 
pairwise GSEA was performed by creating ranked lists of genes using the log, ratio 
of siT'FE3 to siCTRL, and P values were obtained by permuting the gene set (1,000 
permutations). In the paired tumour/normal PDA data sets of Extended Data Fig. 
1b, a paired t-test between matched tumour-normal samples was used to rank 
genes and the ranked list was used in GSEA with P values from permuting the gene 
set (2,500 permutations). For Extended Data Fig. 1k, Database for Annotation, 
Visualization and Integrated Discovery (DAVID) analysis of PDA cell lines in 
CCLE was performed as described by using Pearson correlation to select the 100 
genes most correlated with TFE3 and MITF, and checking for enriched pathways. 
Quantitative proteomics on TFE3-associated proteins. Enriched proteins were 
subjected to multiplexed quantitative proteomics analysis using tandem-mass tag 
(TMT) reagents on an Orbitrap Fusion mass spectrometer (Thermo Scientific). 
Disulfide bonds were reduced with ditheiothreitol (DTT) and free thiols alkylated 
with iodoacetamide as described previously’. Proteins were then precipitated with 
tricholoracetic acid, resuspended in 50 mM HEPES (pH 8.5) and 1M urea and 
digested first with endoproteinase Lys-C (Wako) for 17 h at room temperature and 
then with sequencing-grade trypsin (Promega) for 6h at 37°C. Peptides were 
desalted over Sep-Pak C,g solid-phase extraction (SPE) cartridges (Waters), the 
peptide concentration was determined using a BCA assay (Thermo Scientific) and 
a maximum of 50 yg of peptides were labelled with one out of the available TMT- 
10plex reagents (Thermo Scientific)”. To achieve this, peptides were dried and 
resuspended in 50 il of 200 mM HEPES (pH 8.5) and 30% acetonitrile (ACN) and 
10 ug of the TMT in reagent in 5 pl of anhydrous ACN was added to the solution, 
which was incubated at room temperature for 1 h. The reaction was then quenched 
by adding 6 pl of 5% (w/v) hydroxylamine in 200mM HEPES (pH 8.5) and 
incubation for 15 min at room temperature. The solutions were acidified by adding 
50 pl of 1% trifluoroacetic acid (TFA) and samples that were quantified in one 
mass spectrometry measurement were pooled before being subjected to desalting 
over a Sep-Pak C,s (SPE) cartridge. The peptide mixtures were then dried and 
resuspended in 8 11, three of which were analysed by microcapillary liquid chro- 
matography tandem mass spectrometry on an Orbitrap Fusion mass spectrometer 
and using a recently introduced multistage (MS3) method to provide highly 
accurate quantification». 

The mass spectrometer was equipped with an EASY-nLC 1000 integrated auto- 
sampler and HPLC pump system. Peptides were separated over a 100 1m inner 
diameter microcapillary column in-house packed with first 0.5 cm of Magic C4 
resin (5 um, 100 A, Michrom Bioresources), then with 0.5 cm of Maccel Cg resin 
(3 um, 200 A, Nest Group) and 29cm of GP-C18 resin (1.8 tm, 120 A, Sepax 
Technologies). Peptides were eluted applying a gradient of 8-27% ACN in 
0.125% formic acid over 165 min at a flow rate of 300 nl min’. To identify and 
quantify the TMT-labelled peptides we applied a synchronous precursor selection 
MS3 method*** in a data-dependent mode. The scan sequence was started with 
the acquisition of a full MS or MS1 one spectrum acquired in the Orbitrap 
(m/z range, 500-1,200; resolution, 60,000; AGC target, 5 10°; maximum injec- 
tion time, 100 ms), and the ten most intense peptide ions from detected in the full 
MS spectrum were then subjected to MS2 and MS3 analysis, while the acquisition 
time was optimized in an automated fashion (top speed, 5 s). MS2 scans were done 
in the linear ion trap using the following settings: quadrupole isolation at an 
isolation width of 0.5 Th; fragmentation method, CID; AGC target, 1 x 10*; max- 
imum injection time, 35 ms; normalized collision energy, 30%). Using synchron- 
ous precursor selection the ten most abundant fragment ions were selected for the 
MS3 experiment following each MS2 scan. The fragment ions were further frag- 
mented using the HCD fragmentation (normalized collision energy, 50%) and the 
MS3 spectrum was acquired in the Orbitrap (resolution, 60,000; AGC target, 
5 X 10+; maximum injection time, 250 ms). 

Data analysis was performed on an on an in-house generated SEQUEST- 
based” software platform. RAW files were converted into the mzXML format 
using a modified version of ReAdW.exe. MS2 spectra were searched against a 
protein sequence database containing all protein sequences in the human 
UniProt database (downloaded 4 February 2014) as well as that of known 
contaminants such as porcine trypsin. This target component of the database 
was followed by a decoy component containing the same protein sequences but 


in flipped (or reversed) order**. MS2 spectra were matched against peptide 
sequences with both termini consistent with trypsin specificity and allowing two 
missed trypsin cleavages. The precursor ion m/z tolerance was set to 50 p.p.m., 
TMT tags on the N terminus and on lysine residues (229.162932 Da) as well as 
carbamidomethylation (57.021464 Da) on cysteine residues were set as static 
modification, and oxidation (15.994915 Da) of methionines as variable modifica- 
tion. Using the target-decoy database search strategy** a spectra assignment false 
discovery rate of less than 1% was achieved through using linear discriminant 
analysis with a single discriminant score calculated from the following SEQUEST 
search score and peptide sequence properties: mass deviation, XCorr, dCn, num- 
ber of missed trypsin cleavages, and peptide length. The probability of a peptide 
assignment to be correct was calculated using a posterior error histogram and the 
probabilities for all peptides assigned to a protein were combined to filter the data 
set for a protein FDR of less than 1%. Peptides with sequences that were contained 
in more than one protein sequence from the UniProt database were assigned to the 
protein with most matching peptides”. 

TMT reporter ion intensities were extracted as that of the most intense ion 
within a 0.03 Th window around the predicted reporter ion intensities in the 
collected MS3 spectra. Only MS3 with an average signal-to-noise value of larger 
than 28 per reporter ion as well as with an isolation specificity“ of larger than 0.75 
were considered for quantification. Reporter ions from all peptides assigned to a 
protein were summed to define the protein intensity. A two-step normalization of 
the protein TMT intensities was performed by first normalizing the protein intens- 
ities over all acquired TMT channels for each protein based to the median average 
protein intensity calculated for all proteins. To correct for slight mixing errors of 
the peptide mixture from each sample a median of the normalized intensities was 
calculated from all protein intensities in each TMT channel and the protein 
intensities were normalized to the median value of these median intensities. 


29. Tzatsos, A. et al. KDM2B promotes pancreatic cancer via Polycomb-dependent 
and -independent transcriptional programs. J. Clin. Invest 123, 727-739 (2013). 

30. Garraway, L. A. et al. Integrative genomic analyses identify MITF as a lineage 
survival oncogene amplified in malignant melanoma. Nature 436, 117-122 
(2005). 

31. Gameiro, P. A. et al. In vivo HIF-mediated reductive carboxylation is regulated by 
citrate levels and sensitizes VHL-deficient cells to glutamine deprivation. Cell 
Metab. 17, 372-385 (2013). 

32. Metallo,C.M. etal. Reductive glutamine metabolism by IDH1 mediates lipogenesis 
under hypoxia. Nature 481, 380-384 (2012). 

33. Commisso, C. et a/. Macropinocytosis of protein is an amino acid supply route in 
Ras-transformed cells. Nature 497, 633-637 (2013). 

34. Steinberg, B. E. et al. A cation counterflux supports lysosomal acidification. J. Cell 
Biol, 189, 1171-1186 (2010). 

35. Zoncu, R. et al. mMTORC1 senses lysosomal amino acids through an inside-out 
mechanism that requires the vacuolar H* -ATPase. Science 334, 678-683 (2011). 

36. Hershey, C. L. & Fisher, D. E. Genomic analysis of the Microphthalmia locus and 
identification of the MITF-J/Mitf-J isoform. Gene 347, 73-82 (2005). 

37. Settembre, C. et a/. TFEB links autophagy to lysosomal biogenesis. Science 332, 
1429-1433 (2011). 

38. Corcoran, R. B. et al. STAT3 plays a critical role in KRAS-induced pancreatic 
tumorigenesis. Cancer Res. 71, 5020-5029 (2011). 

39. Chapel, A. et al. An extended proteome map of the lysosomal membrane reveals 
novel potential transporters. Mol. Cell. Proteomics 12, 1572-1588 (2013). 

40. Schréder, B. etal. Integral and associated lysosomal membrane proteins. Traffic 8, 
1676-1686 (2007). 

41. Behrends, C., Sowa, M. E., Gygi, S. P. & Harper, J. W. Network organization of the 

human autophagy system. Nature 466, 68-76 (2010). 

42. Ballabio, A. & Gieselmann, V. Lysosomal disorders: from storage to cellular 

damage. Biochim. Biophys. Acta 1793, 684-696 (2009). 

43. Huang, D.W., Sherman, B.T. & Lempicki, R. A. Systematic and integrative analysis of 

arge gene lists using DAVID Bioinformatics Resources. Nature Protoc. 4, 44-57 

(2009). 

44. Ting, L, Rad, R., Gygi, S. P. & Haas, W. MS3 eliminates ratio distortion in isobaric 

multiplexed quantitative proteomics. Nature Methods 8, 937-940 (2011). 

45. McAlister, G. C. et al. Increasing the multiplexing capacity of TMTs using reporter 

ion isotopologues with isobaric masses. Anal. Chem. 84, 7469-7478 (2012). 

46. Weekes, M. P. et a/. Quantitative temporal viromics: an approach to investigate 

host-pathogen interaction. Ce// 157, 1460-1472 (2014). 

47. Eng, J. K., McCormack, A. L. & Yates, J. R. An approach to correlate tandem mass 
spectral data of peptides with amino acid sequences in a protein database. J. Am. 
Soc. Mass Spectrom. 5, 976-989 (1994). 

48. Elias, J. E. & Gygi, S. P. Target-decoy search strategy for increased confidence in 
large-scale protein identifications by mass spectrometry. Nature Methods 4, 
207-214 (2007). 

49. Huttlin, E. L. etal. A tissue-specific atlas of mouse protein phosphorylation and 
expression. Ce// 143, 1174-1189 (2010). 

50. Jones, S. et al. Core signaling pathways in human pancreatic cancers revealed by 
global genomic analyses. Science 321, 1801-1806 (2008). 


©2015 Macmillan Publishers Limited. All rights reserved 


a 


LETTER 


** 


PSN1 


Relative LC3 fluorescence/area 
wo 


24 
a | 
N 17 
ro) 
ro) o 
* Peeees 
ann 2 
c 7000-5 e 
6000-L | © TFES 
GSE15471 GSE16515 GSE28735 e MITE 
AUTOPHAGY-LYSOSOME GENES AUTOPHAGY-LYSOSOME GENES AUTOPHAGY-LYSOSOME GENES 33007 | O TFEB 
aw NES = 2.33 ae NES = 1.6 p04 NES = 2.09 = 2800 | @ 3TF (mean) - 
Woe FDR q-val < 0.0001 wy, 0.8 FDR q-val < 0.003 ree EDR ¢:val's'0:0001 << 2000-7 e 
g : nom p-val < 0.0001 g nom p-val < 0.01 oo. nom p-val < 0.0001 a 4 7 
8 
8 02 5°? 202 1500+ 0 © ® © OP eG 
5 5 e oleae 
E 0.1 e 0.1 Eo 1000--e------------_ ee 
3 3 2 4 eo 9 ° 
1G 0.0: = 0.0: iG 0.0 5000 ~- -e -o- ° ° 
5000 10000 15000 20000 ul 5000 10000 15000 1000 5000 10000 15000 }00 ] 9 @ SC O @ ° 
j e ee 
HE | (ER | oh —— FP __,—7—— 
PDA Normal PDA Normal PDA Normal 50% 5 008008 
sos zpaesodk gs 
Ge xpe@tgegokts 
9 —-m SS DD EF HRB E 
S oa ps 6 @ 2 vg 
> 316535°%° gs 3S 
a YY 
ASPS 
z & e f 
= irs 
QF i 
a a Oo 
TFE3 @@ s) 50 = 
2 = Zz 
GAPDH rm © 3 40 A 
s9s8sT BS < o 
fe) c 30 > 
you a* xe) 2 
fe Se 8 < 
92 & 8 20 Zz 
ao @ a a 
Mite rf a 
i 10 
GAPDH @peme {7 
PL18 rat 0 ae er 
(anti-MITF ab) © ££ 3 
2 3 2 


@ 
= 
5 
6 
< 


cell line 
tumour 


M14 melanoma 


TCGA (Primary PDA) 


CFPAC-1 


non-PDA Patient derived PDA cell lines 


-50 kDa 


MITF « 
EO Cli 
ee ee es Exons 5-9 TFES omer = = - iho i : 5000 10000 15000 20000 
——S SSS is GAPDH S————eresesenemmeees@ °7 (°° Lee ULL I 


i wows see ~ NES = 1.65 
See @ <5 255582283 8 Bos ronal 
: Troanaoaaaseas ® 03 nom p-val < 0.,042 
| area — Gees — — Ta —_— “seen ; 
e 7 -75 kDa 2 “3 
ie 
e 
Ww 


3TF high 3TF low 
CCLE (PDA cell lines) CCLE: TFE3 S 8 CCLE: MITF 5 3 
MiT/TFE: AUTOPHAGY-LYSOSOME GENES a Me qT 7 
1 1 i) i) 1 ire 
P 0.5 NES = 1.63 ' ; , 
w FDR q-val = 0.134 * ap Re Axon guidance (14) 
8 0.4: nom p-val < 0.036 Ubiquitin proteolysis (22) : i Other glycan degradation (5) 
2 0.3 Spliceosome (19) : : Laeosarietia) 
o l 
£02 RNA polymerase (7) \ Tight junction (13) : i 
= ot Oocyte meiosis (15) | | Fructose/mannose metabolism (5) ; ; 
& pee 
Insulin signaling pathway (17) ECM receptor interaction (9) : i i 
0 5000 wie An Pyrimidine metabolism (13) | ! Histidine metabolism (9) I 1 \ 
HM | 
3TF high 3TF low ° : eS 


-log10 (p value) 


-log10 (p value) 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 1 | Tumour-specific expression and constitutive 
activation of MiT/TFE proteins in PDA. a, Immunofluorescence staining of 
autophagosomes (LC3) and lysosomes (LAMP2) showing extensive overlap of 
these organelles and increased LC3 immunofluorescence in PDA cell lines 
(graph, right; error bars indicate mean + s.d. for N = 3 independent 
experiments with at least 130 cells scored. **P < 0.001. Significance was 
analysed using two-tailed Student’s t-test). Scale bar, 11 um. b, GSEA of 
different human PDA data sets for enrichment of the autophagy-lysosome 
gene signature in tumour versus normal tissue (Gene Expression Omnibus 
(GEO) accession numbers indicated). c, Mean expression (reads per kilobase 
per million (RPKM)) of TFE3, MITF and TFEB individually or cumulatively as 
a meta-gene formed by the mean of the three transcription factors (3TF) in 
primary tumour specimens from the indicated malignancies (The Cancer 
Genome Atlas (TCGA) data set). d, TFE3 and MITF antibody validation by 
western blotting in cells treated with the indicated siRNA. TFE3 antibody 
immunohistochemistry (IHC) validation using alveolar soft part sarcoma 
(ASPS) tissue. This antibody (MRQ-37, Cell Marque) is used as a clinical 
diagnostic for ASPS. Scale bars: 100 tum (top); 20 um (bottom). e, Gene 
expression analysis showing upregulation of MiT/TFE genes in subsets of PDA 
relative to normal pancreatic ductal tissue. SAGE data” from normal 


microdissected pancreatic ductal cells (normal microdissected control, black 
box; HPDE, grey box), cultured PDA cells (cell line), and PDA xenografts and 
primary tumour tissues (tumour). A.U., arbitrary units. f, Quantitative reverse 
transcription-polymerase chain reaction (qRT-PCR) analysis of MiT/TFE 
expression levels in a panel of human PDA cell lines. Note that PL18 and 
HupT3 preferentially express high levels of MITF, while 8988T, PSN1 and 
Pancl express TFE3 at higher levels. The highest levels of TFEB were detected 
in 8902 and A13A cells. g, RT-PCR analysis reveals that PDA cells express 
distinct MITF isoforms. Note the complete absence of all MITF isoforms in 
normal HPDE cells. PDA cells lack the melanoma-specific M isoform (detected 
in M14 melanoma cells, lane 2). h, MITF, TFE3 and TFEB protein levels in a 
panel of non-PDA cell lines, patient-derived PDA cultures and PDA cell lines. 
i, GSEA analysis showing correlation between expression of MiT/TFE factors 
and autophagy-lysosome gene set in primary human PDA specimens (TCGA 
data set). j, GSEA analysis showing correlation between cumulative expression 
of MiT/TFE factors (see Methods) and the autophagy-lysosome gene set in 
human PDA cell lines (Cancer Cell Line Encyclopedia (CCLE) data set). 

k, Database for Annotation, Visualization and Integrated Discovery (DAVID) 
analysis of gene sets correlating with increasing expression of TFE3 (left) or 
MITF (right) in human PDA cell lines (CCLE data set). 
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Extended Data Figure 2 | MiT/TFE-dependent regulation of autophagy- 
lysosome gene expression in PDA cell lines. a, ChIP analysis of Flag-MITF 
(left) and Flag-TFE3 (right) binding to autophagy-lysosome genes in 8902 
and Pancl cells, respectively. Histograms show the amount of immunopre- 
cipitated DNA detected by qPCR normalized to input and plotted as relative 
enrichment over mock control. Error bars indicate mean + s.e.m. for N = 3 
independent experiments. *P < 0.05. EV, empty vector. b, siRNA-mediated 
knockdown of MITF in HupT3 and PL18 cells causes a decrease in autophagy- 
lysosome gene expression, assayed 48 h after siRNA transfection. *P < 0.05. 
c, Knockdown of TFE3 in PSN1, Pancl and 8988T cells, or of TFEB in 8902 
cells, causes a decrease in autophagy-lysosome gene expression. *P < 0.05. 

d, HPDE, HPNE and QGP!1 control cells show minimal changes in autophagy- 
lysosome gene expression upon knockdown of the MiT/TFE genes. 


e, Decreased autophagy-lysosome gene expression after MITF knockdown 
(left; PL18 cells; *P < 0.05) or TFE3 knockdown (right; 8988T cells; *P < 0.01) 
is rescued by transient ectopic expression of MITF or TFE3. Cells were 
transfected with expression constructs for MITF or TFE3 24h post-siRNA 
transfection. After 48 h, gene expression was assayed. f, Expression of 
dominant-negative MITF (MITF-DN) in HupT3 cells causes a decrease in 
autophagy-lysosome gene expression compared with control cells (left; 

*P < 0.02). Similar results are seen in PSN] cells expressing doxycycline (Dox)- 
inducible MITF-DN upon addition of 1 1g ml’ of Dox for 48 h (right; 

*P < 0.05). For all graphs error bars indicate mean + s.d. for N= 3 
independent experiments. Significance was analysed using two-tailed Student’s 
t-test. 
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Extended Data Figure 3 | MiT/TFE transcription factors escape mTOR- 
mediated cytoplasmic retention in PDA. a, Subcellular localization of 
ectopically expressed TFEB-GFP in HPDE cells under full nutrient and 
starvation conditions (3 h HBSS) (left) and Torin-1-dependent nuclear 
localization of ectopically expressed Flag—MITF (left) or Flag—TFE3 (right) in 
HPNE cells (right). b, Subcellular localization of ectopically expressed GFP- 
MITF (left) or GEP-TFE3 (right) in HupT3 and 8988T cells, respectively, under 
full nutrient and starvation conditions (3h HBSS). ¢, d, Subcellular 
fractionation studies showing that endogenous TFE3 (c) and MITF (d) are 
constitutively nuclear localized in PDA cell lines. e, Immunofluorescence 
staining of endogenous TFEB in A13A and 8902 PDA cells. Note the 
predominant nuclear localization under both full nutrient (fed) and starved 
conditions. f, Subcellular fractionation of PL18 and HupT3 PDA cells under full 
nutrient, amino acid (AA) starved and amino acid re-fed conditions shows 
constitutive nuclear residence of endogenous MITF regardless of the nutrient 
status of the cells. Lamin = nuclear fraction; GAPDH = cytoplasmic fraction. 
g, Subcellular fractionation of Pancl cells anda primary patient-derived culture 


(PDAC1) showing constitutive nuclear localization of TFE3 (in Pancl) and 
MITF (in PDAC1) independent of Torin 1 treatment. h, Immunoblot for 
p-ERK1/2 in the indicated cell lines treated with vehicle or with the MEK 
inhibitor, AZD6244 (AZD). i, Neither AZD6244 nor Torin 1 affect MITF 
localization (PL18 cells) or TFE3 localization (8988T cells). j, Immunoblot 
showing readily detectable p-p70S6K in the indicated non-PDA and PDA cell 
lines, and extinction of phosphorylation upon Torin 1 treatment. 

k, Immunofluorescence showing that amino acid re-feeding of starved 8988T 
and Pancl PDA cells results in mTOR (green) translocation from a diffuse 
cytoplasmic distribution to the lysosome (LAMP2; red), as indicated by co- 
localization with LAMP2. Scale bar, 20 jum. 1, The indicated non-PDA (top) and 
PDA cell lines (bottom) stably expressing Flag-tagged TFE3 (F-TFE3) or MITF 
(F-MITF) were treated with vehicle or Torin 1. Cells were then lysed, subjected 
to Flag immunoprecipitation and immunoblotted for Flag and 14-3-3. Note 
that in all cell lines, 14-3-3 is detected in the anti-Flag immunoprecipitates and 
binding is lost upon Torin 1 treatment. All data shown are representative of at 
least N = 3 independent experiments. 
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Extended Data Figure 4 | IPO8 drives increased MiT/TFE nuclear import in 
PDA cells. a, Identification of IPO8 as a PDA-specific binding partner of TFE3. 
HPDE (left), PSN1 (middle) and 8988T cells (right) stably expressing Flag- 
TFE3 or control vector were subjected to affinity purification, followed by 
multiplexed quantitative proteomics analysis using tandem-mass tag (TMT) 
reagents. The graphs show normalized protein intensities in the Flag—TFE3 and 
control samples. Note the specific enrichment of IPO8 in the Flag-TFE3- 
expressing PDA cell lines. In HPDE cells, IPO8 did not score as a significantly 
enriched interactor (mean log, ratio Flag-TFE3/control = —0.14 (N = 3), 
P= 0.30 (paired two-tailed t-test), while in PSN1 and 8988T IPO8 was 
significantly enriched in TFE3 immunoprecipitates (mean log, ratio Flag- 
TFE3/control = 4.35 (N = 3), P= 0.015 (PSN1) and mean log; ratio Flag- 
TFE3/control = 1.70 (N = 3), P = 0.002 (8988T)). b, Immunoprecipitation of 
endogenous IPO8 with Flag-TFE3 in a PDA cell line (PSN1) and primary PDA 
culture (PDAC3). c, (RT-PCR showing increased expression of IPO8 in PDAC 
cell lines and primary patient-derived cultures (red bars) compared to control 
pancreatic ductal cells (HPDE and HPNE, black bars). d, Quantification of 
IPO8 immunohistochemistry staining intensity (0 = no staining to 3 = high 


staining) in normal (N = 11) and PDA (N = 110) patient samples. 

e, Subcellular fractionation and immunoblot analyses of the indicated cell lines 
transfected with control siRNA (siCTRL) or siIPO8. Note that siIPO8 leads toa 
marked decrease in nuclear TFE3 in PDA cells (PSN1 and Pancl1) (left) and in 
whole cell lysates (right). f, Immunoblot of whole cell lysates showing that 
IPO8+IPO7 knockdown decreases the levels of MITF and TFEB in PL18 and 
8902 cells, respectively. g, Knockdown of IPO8 has no effect on total TFE3 
protein (left) or mRNA (right) levels in HPDE and HPNE cells. Error bars 
indicate mean = s.d. for N = 3 independent experiments. h, Torin-1-induced 
TFE3 nuclear localization in HPNE cells is unaffected by knockdown of IPO8. 
i, RT-PCR showing that siRNA-mediated knockdown of IPO8, or of both 
IPO8 and IPO7, effectively reduces target expression without significantly 
affecting the expression of TFE3, TFEB or MITF mRNA levels. Error bars 
indicate mean = s.d. for N = 3 independent experiments. j, Immunoblot of 
8988T cells transfected with control siRNA (siCTRL) or sil[PO8 and treated 
with cycloheximide (CHX) for the indicated time points shows a decrease in 
steady-state levels and stability of TFE3 upon loss of IPO8. Data are 
representative of N = 3 independent experiments. 
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Extended Data Figure 5 | Altered lysosome morphology and function 
following loss of MiT/TFE proteins. a, RNAi-mediated knockdown of MITF 
in PL18 (right; N = 259 siCTRL, N = 263 siMITF) or TFEB in PaTu8902 cells 
(left; N = 273 siCTRL, N = 147 siT FEB) causes aberrant lysosome morphology 
and an increase in lysosome diameter as visualized by immunofluorescence 
staining for LAMP2. **P < 0.001; bar, mean. b, Lysosome size in HPDE cells is 
not affected by knockdown of MITE (N = 156) and TFE3 (N = 81), and is only 
slightly increased by TFEB knockdown (N = 198) relative to siCTRL 

(N = 296). NS, not significant; *P < 0.05; bar, mean. Scale bar, 7.5 jim. 

c, Electron microscopy of 8988T PDA cells transfected with siCTRL (panels 
1-5) or siTFE3 (panels 6-10). Note that TFE3 loss causes an accumulation of 
undigested material shown by asterisks indicating a defect in clearance (graph 
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indicates percentage lysosomes filled with cargo; N = 80 lysosomes for siCTRL 
and N = 153 lysosomes for siTFE3) and an increase in average lysosome 
diameter (quantified in graph on the right; N = 63 lysosomes in siCTRL and 
N= 68 lysosomes in siT'FE3). Scale bar, 1 jim. **P < 0.001. 

d, Immunofluorescence staining with LC3 (green) and LAMP2 (red) in 8988T 
cells after siRNA-mediated knockdown of TFE3, shows similar accumulation 
of undigested LC3-positive aggregates encapsulated within enlarged LAMP2 
positive lysosomes (bottom) compared with control cells (top). Magnifications 
of the boxed regions are shown (right). Scale bar, 7.5 um. Error bars represent 
mean + s.d. Data are representative of at least N = 3 independent experiments. 
Significance was analysed using two-tailed Student's f-test. 
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Extended Data Figure 6 | Ectopic MiT/TFE expression in PDA cell lines 
causes an increase in autophagy-lysosome function. a, b, Ectopic expression 
of MITF induces autophagy-lysosome genes in HPDE cells (a), and causes an 
increased abundance of LC3 puncta (b) as measured by immunoflourescence 
staining of endogenous LC3 (red) and quantified in the graph on the right. 
*P<0.01, **P < 0.001. Scale bar, 7.5 um. c, Ectopic expression of Flag-tagged 
MITF or TFE3 in HPDE cells (left) or HPNE cells (right) causes an increase in 
autophagic flux as measured by the increase in LC3-II versus LC3-I, following 
treatment with 25 1M chloroquine (CQ) for 18h. MITF and TFE3 protein 
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expression is represented by immunoblot for Flag. d, Expression of MITF in 
8902 PDA cells (which lack MITF expression) or in MiaPaca cells (which 
express low levels of all MiT/TFE members) causes an increase in autophagy- 
lysosome gene expression. *P < 0.05. e, Dox-inducible expression of MITF in 
MiaPaca cells causes an increase in endogenous LC3-positive puncta, as 
measured in the graph on the right, indicating increased autophagy induction. 
N = 68 cells, —Dox; N = 100 cells, +Dox; **P < 0.001. Error bars represent 
mean = s.d. Data are representative of at least N = 3 independent experiments. 
Significance was analysed using two-tailed Student’s t-test. Scale bar, 15 um. 
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Extended Data Figure 7 | Role of lysosome in maintaining amino acid levels — b-d, Effect of BafA1 (b), si TFE3 (c), and siATGS (d) on intracellular amino acid 
in PDA. a, Amino acid uptake was measured as fold change in extracellular _ levels in the indicate cell lines. Error bars represent mean + s.d. for N = 3 
amino acids in 8988T cells after transfection with two siRNAs against TFE3 independent experiments. *P < 0.05. Significance was analysed using two- 
relative to siCTRL. Media was changed 1h before media samples were tailed Student’s t-test. 

harvested for analysis. Data are matched to results presented in Fig. 3b. 
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Extended Data Figure 8 | MiT/TFE proteins couple amino acid metabolism and p-AMPK in PDA cells but not in HPDE, HPNE or QGP1 cells. d, Forced 
to energy homeostasis in PDA. a, Knockdown of TFE3 in PSN1 and 8988T expression of MITF or TFE3 in HPDE cells causes a decrease in p-ACC and 


cells causes an increase in p-ACC (Ser 79) and p-AMPK (Thr 172) levels. 
b, Knockdown of TFE3 (in 8988T and PSNI cells) or MITF (in HupT3 cells) 
causes a decrease in cellular ATP levels. N = 3 independent experiments; 


p-AMPK levels. Error bars represent mean + s.d. Data are representative of at 
least 3 independent experiments. Significance was analysed using two-tailed 
Student’s t-test. 


*P < 0.05, **P < 0.001. c, BafA1 treatment (150 nM) for 18h induces p-ACC 
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Extended Data Figure 9 | Regulation of in vitro and in vivo growth by MiT/ 
TFE proteins. a, A panel of human PDA cell lines were infected with shRNAs 
targeting MITF, TFE3 or TFEB. Growth relative to cells infected with shGFP 
control was assayed 8-10 days after infection. Note that sensitivity to individual 
knockdown correlates with the relative expression of each protein (see 
Extended Data Fig. 1f). Coloured bars indicate knockdown condition, which 
leads to the greatest growth impairment. b, Selective sensitivity of PDA cells 
compared with non-PDA cells (QGP1, HPNE and the non-small-cell lung 
cancer cell line H460) to treatment with 50 uM chloroquine (CQ) for 4 days. 
Error bars indicate mean + s.e.m. *P < 0.05. c, Immunoblot showing robust 
detection of LC3-II across a panel of primary patient-derived cultures (PDAC 
1-6) and PDA cell lines. d, Subcellular fractionation showing that PDA patient 
cultures have constitutively nuclear MITF and TFE3. e, RT-PCR showing that 
MITF (left) or TFE3 (right) knockdown suppressed multiple autophagy- 
lysosome genes in patient-derived PDA cells. *P < 0.01. 

f, Immunofluorescence staining for LAMP2 showing that MITF (N = 212 


siCTRL, N = 220 siMITF), TFE3 (N = 228 siCTRL, N = 244 siT FE3) and 
TFEB (N = 229 siCTRL, N = 271 siTFEB) knockdown results in enlarged, 
dysmorphic lysosomes in patient-derived PDA cultures and quantified in the 
graph on the right. Scale bar, 7.5 jum. **P < 0.0001. g, Knockdown of the 
indicated MiT/TFE factors inhibits colony formation in a series of primary 
PDA cultures. h, 8988T cells infected in vitro with shGFP or shTFE3 (left) and 
PL18 cells infected with shGFP or two hairpins targeting MITF (shMITF_1 and 
shMITF_2; right) were implanted subcutaneously on both flanks of SCID 
mice (N = 4 mice per group). Tumour xenograft growth was monitored over 
the course of 70 (8988T) and 50 (PL18) days. Error bars indicate mean + s.e.m. 
i, Forced expression of MITF in Kras@? mouse PanIN cells causes an 
increase in autophagy-lysosome gene expression relative to control 

cells in vitro. *P < 0.005. Data are representative of N = 3 

independent experiments. Significance was analysed using two-tailed 
Student’s t-test. 
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Energetic coupling between plastids and 
mitochondria drives CO> assimilation in diatoms 


Benjamin Bailleul***, Nicolas Berne’, Omer Murik*, Dimitris Petroutsos’, Judit Prihoda*, Atsuko Tanaka‘, Valeria Villanova®, 
Richard Bligny”, Serena Flori®, Denis Falconet”, Anja Krieger-Liszkay’, Stefano Santabarbara®, Fabrice Rappaport?, Pierre Joliot?, 
Leila Tirichine*, Paul G. Falkowski’, Pierre Cardol!, Chris Bowler* & Giovanni Finazzi° 


Diatoms are one of the most ecologically successful classes of photo- 
synthetic marine eukaryotes in the contemporary oceans. Over 
the past 30 million years, they have helped to moderate Earth’s 
climate by absorbing carbon dioxide from the atmosphere, seques- 
tering it via the biological carbon pump and ultimately burying 
organic carbon in the lithosphere’. The proportion of planetary 
primary production by diatoms in the modern oceans is roughly 
equivalent to that of terrestrial rainforests’. In photosynthesis, the 
efficient conversion of carbon dioxide into organic matter requires 
a tight control of the ATP/NADPH ratio which, in other photosyn- 
thetic organisms, relies principally on a range of plastid-localized 
ATP generating processes**. Here we show that diatoms regulate 
ATP/NADPH through extensive energetic exchanges between plas- 
tids and mitochondria. This interaction comprises the re-routing of 
reducing power generated in the plastid towards mitochondria and 
the import of mitochondrial ATP into the plastid, and is mandatory 
for optimized carbon fixation and growth. We propose that the 
process may have contributed to the ecological success of diatoms 
in the ocean. 

In oxygenic photosynthesis, light drives a linear electron flow from 
water to NADPH by the two photosystems (PS I and PS II), and the 
generation of an electrochemical proton gradient (or proton motive 
force, PMF) across the thylakoid membranes, which fuels ATP syn- 
thesis by an ATP synthase. Although the ratio of ATP/NADPH gen- 
erated by linear electron flow is not entirely resolved”*, it is considered 
to be insufficient to fuel CO, import into the plastid and assimilation 
by the Calvin cycle*’. Therefore, to make up the shortfall, additional 
ATP must be produced by alternative pathways that do not generate 
NADPH. In Viridiplantae (including green algae and_ higher 
plants) these alternative electron pathways have been found in the 
chloroplast and mostly comprise cyclic electron flow (CEF) around 
PS I (ref. 3) and/or the water-to-water cycles”; that is, flows of elec- 
trons resulting from the oxidation of water at PS II that are re-routed to 
an oxidase activity. This last group of oxidases include the Mehler 
reaction at the PS I acceptor side*’"”’, the activity of the plastoquinone 
terminal oxidase downstream of PS II (ref. 5) and the oxygenase activ- 
ity of ribulose 1,5-bisphosphate carboxylase-oxygenase (Rubisco) 
(photorespiration®). Although genes encoding most components for 
these processes appear to be present in diatoms’, it is currently 
unknown what mechanisms are used to balance the ATP/NADPH 
ratio. We therefore investigated this question using the model species 
Phaeodactylum tricornutum. 

The PMF generated across thylakoid membranes comprises an elec- 
tric field (AY) and a proton gradient (ApH). The A¥ can be probed 


in vivo by measuring the electro-chromic shift (ECS), that is the Stark 
effect, a modification of the absorption spectrum of specific pigments 
caused by changes in the transmembrane electric field in the plastid’’. 
An ECS signal is present in P. tricornutum (Fig. 1a), and an analysis of 
the ECS signal relaxation after light exposure (Extended Data Fig. 1) 
reveals that it comprises two components displaying different spectra 
(Fig. 1a). One follows a linear dependence on the amplitude of AY 
whereas the other follows a quadratic relationship (Fig. 1b). The exist- 
ence of a ‘quadratic ECS’ is predicted by theory’® but has only been 
observed so far in mutants of green algae with altered pigment 
composition’. The peculiar existence of two different ECS probes in 
wild-type P. tricornutum cells allows an absolute quantification of the 
electric field, providing a valuable tool to analyse the PMF in a living 
cell (see Methods). 

We plotted the amplitude of the quadratic versus linear ECS signals 
during the relaxation of a light-induced PMF and obtained a parabolic 
function (Fig. 1c, d and Extended Data Fig. 2). However, the ECS 
signals did not reach the minimum of the parabola in the dark, but 
rather remained positive. This indicates that a PMF is maintained 
across the thylakoid membrane of diatoms in the dark (AY, 
Fig. 1c). The field AY can not only be dissipated by addition of an 
uncoupler (carbonyl cyanide-4-(trifluoromethoxy)phenylhydrazone 
(FCCP)), but also by blocking mitochondrial electron transport by 
anaerobiosis or addition of antimycin A (AA) plus salicylhydroxamic 
acid (SHAM). The combination of these two inhibitors blocks both the 
cyanide-sensitive (complex III) and the cyanide-insensitive (alterna- 
tive oxidase, AOX) respiratory pathways (Fig. 1d). These results sug- 
gest that the residual PMF in the dark is generated in plastids by the 
chloroplast ATPase by hydrolysis of ATP derived from mitochondria 
(Fig. le)'*. Furthermore, the extent of A Yq observed in P. tricornutum 
is larger than that previously reported in green algae’’, suggesting that 
the ATP exchange could be more efficient in diatoms. 

To evaluate what mechanism regulates ATP/NADPH in the light in 
P. tricornutum, we first used the linear ECS to probe the CEF capacity 
(see Methods). CEF turned out to represent only a very low fraction 
of the maximum electron flow capacity (Fig. 2a and Extended Data 
Fig. 3a, c) and was insensitive to changes in the photosynthetic flux 
(Fig. 2a). Thus, it appears very unlikely that CEF could regulate ATP/ 
NADPH fluxes. Next we explored the water-to-water cycle using 
membrane-inlet mass spectrometry on cells incubated with 189,. O» 
consumption increased with light, being ~2.5-fold higher at saturating 
light intensities than in the dark (Extended Data Fig. 3b, d). We also 
found that the light-stimulated O, consumption was blocked by 
3-(3,4-dichlorophenyl)-1,1-dimethyl-urea (DCMU), which inhibits 
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O, production by PS II (Extended Data Fig. 3b, d), indicating that this 
process is fed by electrons generated by PS II. O2 consumption 
increased linearly with O, production, in agreement with earlier find- 
ings in another diatom species”, indicating that a constant proportion 
(~10%) of the electron flow from photosynthesis is re-routed to an 
O,-consuming pathway, regardless of light intensity (Fig. 2b). 


LETTER 


Figure 1 | ECS measures the PMF in 

P. tricornutum. a, Deconvolution of the 
experimental ECS spectrum (black) into linear 
(blue) and quadratic (red) spectral components 
(see Methods); r.u., relative units. b, Schematic 
representation of polar (blue) and polarizable (red) 
pigments, and their associated linear (blue) and 
quadratic (red) ECS responses to the electric field. 
Black: thylakoid lipid bilayer. Green ‘+’ and ‘—’ 
symbols: AY. Red arrows: pigment polarization 
induced by AY’. ¢, d, Relationship between 
quadratic and linear ECS in control (c) and in cells 
treated with uncoupler (8 4M FCCP, black 
squares), anaerobiosis (red circles) and respiratory 
inhibitors (AA, 5 uM, and SHAM, 1 mM; blue 
circles) (d). Green arrow: extent of the dark electric 
field (AY). Representative of five (c) and three 
(d) independent biological samples. e, Schematic 
representation of the energetic interactions 
between plastid (left) and mitochondria (right) in 
the dark. Red arrows: respiratory electron flows. 
Green dashed line: putative ATP/ADP exchange 
pathway between the organelles. ATPase, ATPase/ 
synthase; bef, cytochrome bef; I/III/IV, respiratory 
complexes I, III and IV. 


To test whether the O,-consuming pathway occurs in the plastid or 
relies on mitochondrial activity, we used increasing concentrations of 
inhibitors to titrate respiration and tested possible consequences on 
photosynthesis. We reasoned that if respiration consumes reducing 
equivalents generated in the plastid to generate additional ATP, any 
mitochondrial dysfunction should negatively impact photosynthesis. 
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Figure 2 | Energetic interactions between 
mitochondria and plastid in P. tricornutum. 

a, Relationship between CEF capacity and total 
electron flow (TEF, mean + s.d. from data in 
Extended Data Fig. 3a, c). b, Relationship between 
oxygen uptake (Up) and gross photosynthesis 

(Eo) as measured by membrane-inlet mass 
spectrometry (mean + s.e.m. from data in 
Extended Data Fig. 3b, d). c, Dependency of 
photosynthetic activity (ETRps y) on respiration 
rates (as the percentage of control, mean = s.d. 
from data in Extended Data Fig. 4). Closed circles: 
SHAM + AA; open circles: SHAM + myxothiazol 
treatments (see Methods). d, Schematic 
representation of possible plastid-mitochondria 
metabolic interactions in the light. Continuous and 
dashed blue arrows: photosynthetic linear and 
cyclic flows, respectively. Yellow arrow: exchange 
of reducing equivalents. For other symbols, see text 
and Fig. le. 
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Figure 3 | Phenotypic traits of AOX knockdown 
lines of P. tricornutum. a, Relative sensitivity 

of photosynthesis (ETRps 1) to addition of 
respiratory inhibitors: AA (magenta), SHAM (dark 
yellow) and AA + SHAM (black) (n = 3 + S.D), 
or to knockdown of AOX (n = 5 +s.d.). Blue 

and red: kd-c5 and kd-c9, respectively. b, Western 
blot analysis of photosynthetic and respiratory 
complexes. c, Growth rates of the wild-type and 
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We found that this was indeed the case, as photosynthetic electron 
transfer rate (ETRps 11) linearly followed changes in respiration (Fig. 2c 
and Extended Data Fig. 4). We conclude that a partial re-routing of the 
photosynthetic flow towards mitochondrial respiration rather than 
CEF optimizes photosynthesis in diatoms, providing commensurate 
ATP per NADPH at all irradiances (Fig. 2d). 

Photosynthetic activity displayed increasing sensitivity to AOX 
inhibition with light, suggesting that cyanide-insensitive respiration 
becomes prominent in high light (Fig. 3a). This prompted us to gen- 
erate AOX knockdown cell lines of P. tricornutum. Two independent 
clones were selected on the basis of reduced AOX protein accumula- 
tion (Fig. 3b) and decreased activity (measured as the SHAM-sensitive, 
AA-insensitive component of respiration; Extended Data Fig. 5a). The 
AOX contribution, representing ~50% of dark respiration in wild- 
type cells (Extended Data Fig. 4e), was decreased two-fold in the two 
knockdown lines (Extended Data Fig. 5a). Confocal microscopy con- 
firmed the mitochondrial localization of the targeted gene product 
(Extended Data Fig. 6a). The reduced AOX activity in the knockdown 
lines paralleled a diminished PMF in the dark (AY, Extended Data 


Thalassiosira weissflogii Thalassiosira pseudonana 


100 


Irradiance (mol quanta m-? s~') 


200 300 400 


AOX lines (n = 7 + s.d.). d, In vivo light 
dependency of NADPH redox state in wild-type 
and AOX lines. Data are normalized to the 
maximum value in the light. e, In vivo 31D_NMR 
evaluation of the NTP content in wild-type and 
AOX knockdown lines, in the dark or in low light, 
with or without AA (data normalized to the 

dark values). d, e, Three independent biological 
samples. 
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Fig. 5b), despite the fact that overall dark respiration was slightly 
higher (Extended Data Fig. 5a). This effect was strongly enhanced 
by addition of AA (Extended Data Fig. 5b). The decreased AOX activ- 
ity also correlated with a decreased photosynthetic capacity, especially 
under high light intensities (similar to SHAM-treated wild-type cells; 
Fig. 3a), and a diminished growth rate (Fig. 3c), which was exacerbated 
further by inhibiting complex III using AA (Extended Data Fig. 5c). 
The growth and photosynthetic phenotypes were not due to changes in 
the accumulation of the photosynthetic complexes, for which we 
detected comparable levels of representative proteins in all cell lines 
(Fig. 3b). The only exception was a small decrease in the cytochrome 
bef content in the knockdown cell lines, which nonetheless did not 
decrease its overall catalytic turnover (Extended Data Fig. 7). 

Our working model presented in Fig. 2d predicts that disruption 
of the plastid-mitochondria interaction in the knockdown cell lines 
should lead to the accumulation of NADPH and a decreased cellular 
content of ATP in the light. Indeed, in vivo assessments of the pools 
of NADPH and ATP in wild-type and knockdown cell lines con- 
firmed an increase in the NADPH/NADP* ratio with light intensity 
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Figure 4| ATP transfer from mitochondria to plastid in representative 
diatoms. a, Spectra of the linear (blue) and quadratic (red) ECS probes in four 
diatoms. Blue and red vertical dashed lines represent the wavelengths used for 
linear and quadratic ECS, respectively. Spectra are normalized to 1 at the 
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maximum value of the linear ECS. b, Relationship between the quadratic and 
the linear ECS in control conditions (open symbols) and in AA + SHAM 
conditions (filled symbols). A Yq is represented as a horizontal arrow. Data are 
representative of three independent biological samples. 
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(Fig. 3d and Extended Data Fig. 8a), accompanied by a net decrease 
of cellular ATP levels (Fig. 3e and Extended Data Fig. 8b), both of 
which were more drastic in AOX knockdown cells than wild-type 
cells. These observations confirm that mitochondrial respiration is 
directly involved in the adjustment of the ATP to NADPH ratio in 
the plastid. 

We then examined the generality of our findings in other diatom 
species. The similar ECS features (linear and quadratic components) in 
Thalassiosira pseudonana, Thalassiosira weissflogii, Fragilaria pinnata 
and Ditylum brightwellii (Fig. 4a) were used to confirm the presence ofa 
PMF in the plastids in the dark at the expense of hydrolysis of ATP 
supplied by the mitochondria in all cases (Fig. 4b). Moreover, a neg- 
ligible contribution of CEF (Extended Data Fig. 9) and a significant 
involvement of mitochondrial respiration to photosynthesis (Extended 
Data Fig. 10) were found in all these species. The involvement of mito- 
chondrial respiration in the optimization of photosynthesis therefore 
appears to be a general and conserved feature in diatoms. 

We conclude that ATP generation sets the rate of photosynthetic 
carbon assimilation in diatoms, as suggested in other photosynthetic 
organisms*, but that in contrast with the Viridiplantae, optimization 
of diatom photosynthesis does not rely on plastid-localized processes. 
Instead, constitutive energetic interactions between diatom mito- 
chondria and plastids ensure the sharing of reducing equivalents 
and ATP to fuel CO) assimilation in the light (Fig. 2d). While the 
process we have uncovered has some similarities to the export of 
reducing equivalents from the plastids towards mitochondria in 
plants and green algae, the fundamental difference is that in plants 
and green algae the process serves as a valve to dissipate excess of 
reducing power”, and can only participate in the regulation of the 
ATP/NADPH ratio when the chloroplast capacity to make extra ATP 
is genetically disrupted****. We propose that the presence of trans- 
porters such as the malate shuttle’', which are encoded in diatom 
genomes”, as well as the very tight physical interactions observed in 
diatoms between plastids and mitochondria (Extended Data Fig. 6b), 
may make these energetic interactions possible between the two 
organelles. Because diatom plastids are surrounded by four mem- 
branes rather than two as in Viridiplantae’’, it will be of interest to 
elucidate the configuration of such transporters, as well as the ration- 
ale for the diminished CEF efficiency in diatoms. More generally, the 
coupling of respiratory and photosynthetic activities in diatoms 
should be explored in the context of resource utilization in the ocean 
and as a means to boost the production of useful metabolites for 
biotechnology. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Growth conditions. Wild-type and AOX transformant lines of P. tricornutum 
Ptl 8.6 (CCMP 2561) were grown in artificial sea water (ASW”*). T. pseudonana 
(CCMP 1335), T. weissflogii (CCMP 1336), F. pinnata (CCAP 1029/2) and 
D. brightwellii (CCMP 359) were grown in F/2 medium, supplemented 
with silicate**. All strains were grown at 19+1°C in semi-continuous batch 
culture (with moderate shaking for P. tricornutum, T. pseudonana and 
T. weissflogii). The photoperiod was 12 h light/12 h dark, and light irradiance 
was 70 mol quantam * s *. Cell concentration was determined daily with a 
Z2 Coulter Counter analyser (Beckman Coulter) to ensure all the experiments 
were performed with cells in exponential phase. For biophysical measurements, 
cells were concentrated by centrifugation and resuspended in their growth med- 
ium (supplemented with 10% w/v Ficoll to prevent cell sedimentation) and kept 
in the dark at least 30 min before measurements. 

Inhibitors. DCMU, 2,5-dibromo-3-methyl-6-isopropyl-p-benzoquinone (DBMIB), 
FCCP, AA, myxothiazol and SHAM (Sigma-Aldrich) were dissolved in ethanol, 
whereas hydroxylamine, glucose, glucose oxidase and catalase (Sigma-Aldrich) 
were dissolved in deionized water. FCCP was used at a very low concentration 
(8 1M, Fig. 1d) to allow the disruption of the dark PMF without preventing the 
light-induced generation of PMF needed to quantify AY. AA and myxothiazol 
were used at 5 1M, unless otherwise stated. DCMU was used at a concentration of 
15 uM. In every measurement involving hydroxylamine or SHAM, the lowest 
inhibitor concentration to induce a full inhibition of PS II activity or maximum 
inhibition of respiration, respectively, was used. The range of concentrations 
used was 30-100 uM and 0.5-1 mM for hydroxylamine and SHAM, respec- 
tively. Anaerobic conditions were obtained through incubation with catalase 
(1,000 U ml’), glucose (10 mM) and glucose oxidase (20,000 U ml '). AA has 
been previously described as an inhibitor of cyclic electron flow, affecting the 
NDH-independent pathway”. This potential effect was ruled out in diatoms 
because no change in cyclic electron flow was noticed upon addition of AA. 
Consistent with the fact that genes encoding some members of the NDH complex 
are absent in diatom genomes, this indicates that AA does not affect the NDH- 
independent CEF pathway in diatoms. AA and myxothiazol were preferred to 
potassium cyanide to block the cyanide sensitive pathway of respiration because 
potassium cyanide also affects Rubisco activity”, ascorbate peroxidase” and Cu/ 
Zn superoxide dismutase”. 

Deconvolution of linear and quadratic ECS components. To deconvolute the 
linear and quadratic contributions to the ECS signals, cells were left for an hour in 
the cuvette to reach complete anaerobiosis. In these conditions, the ATP synthase 
activity is slowed down”, and long-living ECS signals are no longer contaminated 
with other light-induced absorption changes (principally associated with c-type 
cytochromes). Light stimulation of cells was achieved with a series of six laser 
single-turnover (duration ~7 ns) saturating flashes, provided by a laser dye (LDS 
698) pumped by a frequency doubled Nd-YAG laser (Quantel). We considered 
that the relaxation of the electric field generated by the light stimulus is described 
by the exponential function AY = AY, exp(—t/t), where t is time, AY is the 
initial electric field generated by the light, and 7 is the electric-field decay lifetime. 
The linear and quadratic components of the ECS are theoretically proportional 
to AW and AY”, respectively. Therefore, the AI/I spectro-temporal matrices 
(from 460 to 600 nm) can be described by a sum of two exponentials: 
y(A,t) = A(A) exp (— t/t) + B(A) exp ((—2t/t)) + C(2).. The kinetics of ECS 
relaxation were fitted by a global routine, which considers the lifetime t as a global 
(wavelength-independent) variable, and the amplitudes of linear and quadratic 
components (A and B, respectively) as local (wavelength-dependent) variables. A 
non-decaying component (C) was also included in the fit to account for a small 
fraction of residual signal at long delay times. The plot of the A and B amplitudes as 
a function of the wavelength provides the decay associated spectra of the linear and 
quadratic contributions to the ECS signal, respectively, which are shown in Fig. la. 
The fit was performed with homemade software, which used the MINUIT pack- 
age, developed and distributed by CERN (Geneva, Switzerland) and implemented 
in FORTRAN77. It minimizes the reduced sum of squared residues between the 
model function and the experimental data, employing a two-step protocol invol- 
ving an initial search that utilizes the Simplex method (Nelder-Mead algorithm) 
and a refined search using the Levenberg-Marquardt algorithm as described in ref. 
32. The quality of the fit description was judged on the basis of reduced sum of 
squared residues statistics, visual inspection of the fit residuals, residuals autocor- 
relation and stability of the solutions upon random perturbation of the best-fit. 
The deconvolution was performed on two independent biological samples, giving 
similar results. 


Measurements of c-type cytochromes and linear and quadratic ECS. 
Absorption difference signals were measured at different wavelengths with a 
Joliot-type spectrophotometer (JTS-10, Biologic), equipped with a white probing 
LED and the appropriate interference filters (3-8nm bandwidth). For AY 
measurements, the PMF was increased using an ~10ms pulse of saturating 
(4,500 umol quanta m “s !) red light (see Extended Data Fig. 3 for represent- 
ative ECS kinetics). For P. tricornutum, ECS signals were evaluated using three 
wavelengths, to eliminate contribution from c-type cytochromes (see Extended 
Data Fig. 2). The latter was calculated as cyt c= [554] — 0.4[520] — 0.4[566], 
where [554], [520] and [566] are the absorption difference signals at 554 nm, 520 
nm and 566 nm, respectively. The very similar relaxation of c-type cytochromes in 
aerobic and anaerobic conditions, despite very different ECS relaxations 
(Extended Data Fig. 2b), demonstrates the validity of the cyt c deconvolution 
procedure. Then, ecsj, and ecsguaa (ECS signals before correction for AY) 
were estimated from the following relationships: ecsjj, = [520] — 0.25cyt c and 
€CSguaa = [566] + 0.15cyt c. For the other diatoms, appropriate wavelengths were 
chosen for calculating ecsj;, and ecsguaq (red and blue lines in Fig. 4) to minimize 
the cytochrome c contributions. The relationships between ecsguaq and ecsjin were 
fitted with the parabolic equation ecsguaq + aA W? =alecs;, + AY)’, where 
AY, is the electrical component of the PMF in the dark and a is constant for all 
the conditions in a diatom species (see Extended Data Fig. 2c). The ecsj, and 
€CSguaq Values represent ECS changes relative to dark values. Therefore, we cor- 
rected them for the dark electric field. This leads to ECS}, = ecsjin + AW, and 
ECSguad = €CSquaa + aA W,’, namely absolute values of the ECS signals. This sim- 
ply corresponds to a shift of the x- and y-axes to allow the minimum of the 
parabola to coincide with the origin of the axes (see Extended Data Fig. 2d), 
and gives ECSguad = aECSjin’. This leads to the evaluation of AY as the minimal 
ECS), value of the experimental data. In Fig. 1c, d, and Extended Data Fig. 2c, d, 
ECS data were normalized to the ecs),, increase upon a saturating laser flash (that 
is, one charge separation per photosystem, see ref. 33). This allows the estimation 
of AY, in P. tricornutum as approximately five charge separations by PS, namely 
~100 mV (ref. 12) (Fig. 1c). The presence of linear and quadratic ECS components 
allows measurement of the absolute value of AY in the dark (A¥Y,). Indeed the 
amplitude of the linear ECS response (ecsj,) observed upon a light stimulus 
increasing AY is constant, namely independent of the value of the field pre- 
existing the illumination (AY). Conversely, the amplitude of the quadratic 
ECS response (ecsguaa) is a function of the value of AYq. Therefore, plotting the 
amplitude of the ecsguaa Versus ecs};, allows quantification of the absolute value of 
the electric field in the dark (Aq). 

Cyt bef turnover was measured through the slow phase (phase b*’) of the linear 
ECS, which reflects bgf-catalysed charge transfer across the membranes, and 
through the reduction rate of the c-type cytochromes (c¢/f), using the three-wave- 
lengths deconvolution procedure described above. Measurements were performed 
after a saturating laser flash. 

Measurements of photosynthetic flows. For calculation of the TEF (the sum of 
linear and cyclic electron flows) and CEF capacities, we measured the photoche- 
mical rates in the absence and presence, respectively, of DCMU. In brief, under 
steady-state illumination conditions, the ECS signal results from concomitant 
transmembrane potential generation by PS II, the cytochrome bef complex and 
PS I, and from transmembrane potential dissipation by the plastid ATP synthase. 
When light is switched off, PS activities stop immediately, while ATP synthase and 
cytochrome bef complex activities remain (transiently) unchanged. Therefore, the 
difference between the slopes of the linear ECS signal (ECS) measured in the 
light and after the light is switched off (Sp — S,) is proportional to the rate of PSI 
and PS II photochemistry (that is, to the rate of ‘total’ electron flow, Extended Data 
Fig. 3a). Because the linear ECS has been normalized to the amplitude of the linear 
ECS signal induced by a saturating laser flash** (see above), the difference of slopes 
evaluates the number of charge separations per photosystem and per second. The 
rate of CEF can be evaluated using the same approach under conditions where PS 
II activity is inhibited by DCMU, and dividing this slope by the linear ECS signal 
induced by a saturating laser flash in the presence of PS II inhibitors (one charge 
separation per PS I (ref. 34)). This was done using saturating concentrations of 
DCMU, which block PS II oxidation, and of hydroxylamine, to avoid charge 
recombination within PS II. 

Fluorescence-based measurements. Fluorescence-based photosynthetic para- 
meters were measured with a fluorescence imaging setup described in ref. 35. 
Photosynthetic electron transfer rate ETRps y, and NPQ were calculated, respect- 
ively, as I(Fyy’ — F)/Fm’ and (Fm — Fm')/Fm', where F and Fy’ are the steady-state 
and maximum fluorescence intensities in light-acclimated cells (respectively), 
F,, is the maximal fluorescence intensity in dark-adapted cells, and I is 
the light irradiance in pmolquantam™’s~' (refs 36, 37). In Fig. 2c and 
Extended Data Fig. 4a, d, the light irradiance is 30 mol quanta m ~* s_'. The 
light saturation curves of ETRps 1; were fitted with the exponential rise function 
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P= Pyax(1 — exp(E/E,)), where Prax is the maximal photosynthetic electron 
transport rate and E is the optimal light. AETR/ETR (Fig. 3a) was calculated as 
(ETRrer — ETR) X 100/ETR, cs the reference being the value measured in wild-type 
cells in untreated conditions. 
Membrane-inlet mass spectrometry measurements. Samples were introduced 
in a 3 ml thermostated cuvette, which was connected to a Quadrupole Mass 
Spectrometer (QMS 200, Pfeiffer Vacuum Prisma) by a stainless steel vacuum 
tube (0.125 inches) passing through a water trap filled with ethanol and dry ice. 
The sample was separated from the tube via a gas-permeable inlet system (poly- 
tetrafluoroethylene (PTFE) membrane). 180, was added as a bubble to the algal 
suspension, and the bubble was removed before the experiment. The measure- 
ments of the partial pressures of 6, (p'°O., mlz = 32), °O, (p'*0., m/z = 36) 
and argon (m/z = 40) were performed after the cuvette was sealed. A blue light- 
emitting diode (LED) source was connected to the cuvette, and the light irradiance 
was manually adjustable in the 0 to ~800 pmol quantam™~*s~' range. The tem- 
perature was kept at 19 + 1 °C in the cuvette during the experiment. 

To calculate gross O2 production (Eo) and uptake (Uo), respectively production 
and consumption by the cells, we adapted the equations from ref. 38: 


=(Al BO 0] /At+k[8O op«'*o] [oO o})/[%o [oO 


9 = (A[°O] /At + k['°O]) + Up(([80} + ['°O})/['8O)) 


where k is the rate constant of O, decrease measured in the absence of algae. We 
normalized Oz to Argon (a biologically inert gas with very similar solubility 
properties), which decreases the sensitivity of O2 measurements to fluctuations 
by ~80% (ref. 39). The gas concentrations were calibrated by measuring the air- 
equilibrated O, concentration (stirring deionized water in the open cuvette for at 
least 5 h) and background O, (bubbling with N,). 

Respiration rates. Respiration rates were measured as O, exchange rates using a 
Clark-type oxygen electrode at 19 °C (Hansatech Instruments). AOX capacity was 
measured as SHAM-sensitive respiration in conditions where the cyanide-sens- 
itive pathway was inhibited beforehand (AA, 5 uM). 

ATP/NADPH in vivo measurements. NADP'/NADPH redox changes were 
followed in living cells using a Dual-PAM (Walz). NADPH fluorescence was 
measured at 460 nm, upon excitation in the near ultraviolet. Chlorophyll a con- 
centration was ~5 pg ml’. ATP content was measured using an in vivo °'P-AMX 
400 NMR spectrometer equipped with a 25-mm multinuclear probe tuned at 161.9 
MHz, and a homemade lighting system, as described in ref. 40. The relative ATP 
content was estimated in vivo from the surface of «-, B- and y-phosphorus res- 
onance peaks corresponding to the three phosphates of NTPs, which dominate the 
NMR spectra with inorganic phosphate and polyphosphates’. 

Western blots and immunolocalization. Protein samples (5-10 jig) were loaded 
on 13% SDS-polyacrylamide gel electrophoresis (PAGE) gels and blotted to nitro- 
cellulose. Primary AOX antibody was custom designed (Sdix, 1:4,000 dilution). All 
other antisera used were obtained from Agrisera (http://www.agrisera.com/en/ 
info/catalog.html). The blots were developed with ECL detection reagent and 
images of the blots were obtained using a CCD (charge-coupled device) imager 
(Chemidock MP Imaging, Bio-Rad). The results presented in Fig. 3b are repres- 
entative of a total of five western blots on independent biological samples. 
Immunolocalization of AOX was generally done as described in ref. 34. Briefly, 
cells were fixed with 2% formaldehyde in culture media for 20 min, washed three 
times with marine phosphate buffer (mPBS, see ref. 42) and permeabilized by 1% 
Triton X-100 in mPBS for 10 min. The cells were washed again, blocked for 30 min 
in 1% BSA in mPBS, and incubated overnight at room temperature (25 °C) with 
anti-AOX antibody from rabbit (custom design, Sdix, 1:200 dilution in mPBS). 
The cells were then rinsed with mPBS and incubated with donkey Alexa 488- 
conjugated anti-rabbit IgG antibody (Life Technologies, at 1:100 dilution in 
mPBS) for 2 h at room temperature. Cells were then stained with 0.5 ng ml * 
DAPI (4’,6’-diamidino-2-phenylindole, Life Technologies) for 10 min and 
mounted with Vectashield (Vector Laboratories) after a rinse. Finally the cells 
were observed using a Leica SP5 confocal microscope (Leica Microsystems). 
Electron microscopy. For transmission electron microscopy, P. tricornutum 
cells were fixed in 0.1 M cacodylate buffer (Sigma-Aldrich), pH 7.4, containing 
2.5% glutaraldehyde (TAAB), 2% formaldehyde (Polysciences) for 1 h at room 
temperature and then prepared according to a modified protocol from T. J. 
Deerinck et al. (http://ncmir.ucsd.edu/sbfsem-protocol.pdf). After the dehyd- 
ration steps, the cells were infiltrated with ethanol/Epon resin mixture (2/3-1/3 
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for 1 h and 1/3-2/3 for 1 h) and finally embedded in Epon in a 60 °C oven for 48 h 
or longer. Ultrathin sections (60 nm) were prepared with a diamond knife on an 
UC6 Leica ultramicrotome and collected on 200 1M mesh nickel grids before 
examining on a JEOL 1200 EX electron microscope. 

AOX knockdown lines. Partial coding sequence of AOX (identity Phatr2_ 
bd1075) was amplified using primers AOXcFwd (TGCTCCGGAGGACAAT 
GAATTCGC) and AOXcRev (TGGTCTAGACGTCGCGATGTTC), cut by 
EcoRI/Xbal (Fermentas) and cloned into an EcoRI/Xbal-digested antisense con- 
struct*’, under control of the FepB promoter. The AOX antisense construct 
was introduced into wild-type P. tricornutum cells using the standard micropar- 
ticle bombardment procedure with a PDS-1000/He Particle Delivery System 
(Bio-Rad)**“*. After 2 days, cells were transferred to F/2-supplemented-filtered 
seawater plates containing 100 1g ml phleomycin, a selective antibiotic for the 
Sh ble gene (InvivoGen, 09K30-M™M). Putative antisense lines were verified for the 
presence of the transformed DNA by PCR screening using the primers ShbleFwd 
(ACCAGTGCCGTTCCGGTG) and ShbleRev (TCGGTCAGTCCTGCTCCTO), 
and the most strongly silenced lines were chosen on the basis of reduction of AOX 
protein levels as measured by SDS-PAGE western blot with an anti- AOX antibody 
(see section on western blots and immunolocalization). 
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Extended Data Figure 1 | Deconvolution of the quadratic and linear ECS 
in P. tricornutum. a, Absorption difference (AI/I) kinetics followed at 
different wavelengths in P. tricornutum, after a series of six saturating laser 
flashes, in anaerobic conditions. Solid lines correspond to the global fit of the 
experimental data with a sum of two exponential decays, with time constants 
t and 2t, respectively, as expected for linear and quadratic dependencies 
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(see Methods). b, AJ/I spectra are shown at different times during ECS 
relaxation. All spectra were normalized to 1 at 520 nm for better comparison. 
The observation that the blue and green parts of the spectrum are homothetic 
during relaxation, while changes are seen in the red most part of it, reflects 
the presence of the two ECS components, having different relaxation kinetics. 
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Extended Data Figure 2 | Separation of c-type cytochrome signals from 
linear and quadratic ECS signals in P. tricornutum. a, Kinetics of AI/I 
changes at 520, 554 and 566 nm during an ~10 ms pulse of saturating red light 
(4,500 pmol quanta ms _') and the subsequent dark relaxation (top: control 
conditions; bottom: AA + SHAM). b, Kinetics of ecsjin, €CSguaa changes and 
c-type cytochrome redox state, from kinetics in a, after deconvolution as 
explained in Methods. c, d, Relationship between the quadratic and the linear 
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ECS, before (ecsyjn, €CSguaa ¢) and after (ECSiin, ECSguaa d) correction for 
the dark electric field (see Methods). Dark yellow and magenta symbols 
correspond to control and AA + SHAM conditions, respectively. The green 
arrow indicates the value of AY, in control conditions. Data are representative 
of five independent biological samples. The black boxes in a and b indicate 


periods of darkness. 
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Extended Data Figure 3 | Cyclic electron flow and water-to-water cycles in 
P. tricornutum. a, Representative traces of changes in ECS, (normalized as 
explained in Methods, namely expressed in charge separation per PS) to 
evaluate linear and cyclic electron flow. Cells were illuminated with 1,870 pmol 
quanta m *s_! of red light, in absence (filled circles) and presence (open 
circles) of DCMU and then transferred to the dark; r.u., relative units. 

b, Representative traces of the '°O, and 180, concentrations at the offset of a 
280 umol quanta m 7s ! blue light; a.u., arbitrary units. In a and b, light 
and dark periods are represented by white and black boxes, respectively. 


Irradiance (umol quanta.m”.s”') 


c, Photochemical rate corresponding to total electron flow (TEF, dark symbols, 
data from four independent biological samples) and CEF (red symbols, 
n=8+s.d.) at different irradiances. TEF and CEF were estimated from the 
initial slope of the ECS decay, as (Sp — S, control) and (Sp — S; DCMU), 
respectively (see Methods). d, Light-dependencies of oxygen uptake (Up, 
half-filled symbols) and gross photosynthesis (Ej, open symbols) in control 
conditions (dark) and in the presence of DCMU (red). Data from two 
independent biological samples (squares and circles). 
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Extended Data Figure 4 | A ¥% and photosynthesis under respiratory 
inhibition in P. tricornutum. a-c, Dependency of the ETRps y (a), AWa 

(b) and dark respiration (c) upon inhibition of the cyanide-sensitive respiratory 
pathway with different concentrations of antimycin A, in the presence of 
saturating SHAM (1 mM). Data from two independent biological samples. 
Experimental data were fitted with a mono-exponential decay function. 
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d, e, ETRps 1 (d), AYq and dark respiration (e), expressed as percentage of 
control, in the presence of saturating AA (5 1M), SHAM (1 mM) or 

AA + SHAM (four independent experiments = s.d.). f, Relationship between 
A and mitochondrial respiration in samples treated with increasing 
concentrations of AA in the presence of 1 mM SHAM (mean value + s.e.m. 
from b and c). 
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Extended Data Figure 5 | Dark respiration, PMF and growth in AOX conditions (green), in the presence of AA (dark green) and in the presence of 
knockdown lines of P. tricornutum. a, Respiratory activity of wild-type and AA + SHAM (white). c, Growth curves of wild-type and AOX knockdown 
AOX knockdown lines. Total respiration rate (red bars) and the contribution of __ lines in the presence/absence of AA (2 1M). Three independent growth curves 


the AOX capacity (white bars, see Methods) were normalized to wild-type are shown for each strain/condition. AA was added every day and cells were 
values (n = 5 + s.d.). b, ECS-based measurements of AYq in wild-type grown in continuous light to prevent them from dying in the dark because 
(n= 3 +s.d.) and AOX knockdown lines (n = 2 + s.e.m.), in control of lack of respiration. 
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Extended Data Figure 6 | Subcellular localization of AOX in P. tricornutum __ localization is highly similar to that observed with the mitochondria-specific 
and plastid-mitochondria interaction in P. tricornutum wild-type cells. mito-tracker probe (data not shown). Images are representative of 60 cells 

a, Subcellular localization of AOX. Cells were treated with an anti-AOX from three independent biological samples. b, Electron micrographs of the 
antibody and then with a secondary Alexa Fluor 488 antibody (see Methods). _ plastid-mitochondria juxtaposition in P. tricornutum. Arrows indicate possible 
Positions of plastid and nuclei are indicated by chlorophyll a autofluorescence _ physical contacts between the plastid and mitochondrial membranes. Image is 
(red) and DAPI staining (blue), respectively. The pattern of AOX representative of 51 images from seven independent biological samples. 
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Extended Data Figure 7 | Cytochrome b,f turnover in P. tricornutum given 10 s apart. Data were normalized to the amplitude of the fast phase of the 
wild-type and AOX knockdown lines. a, Schematic representation of the ECSjin signal. Cytochrome c and ECS), were deconvoluted as explained in 
electron-flow reaction steps in the cytochrome bef complex, which can be Methods. Three independent biological samples are shown in red, blue and 


evaluated by spectroscopic measurements. b, Slow phase of ECS, indicating green colours. Cell concentration was 2 X 10’ cells per millilitre. Note that both 
cytochrome bg activity (top) and time-resolved redox changes of cytochromes __ the slow phase of the ECS, and the reduction of cytochromes c¢/f were 

C6/f (bottom) in wild-type and AOX knockdown lines (kd-c5 and kd-c9). completely abolished by the plastoquinone competitive inhibitor DBMIB 

P. tricornutum cells were exposed to saturating single-turnover laser flashes (10 uM; black arrow). 
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Extended Data Figure 8 | In vivo changes in the NADPH redox state and AOX knockdown lines c5 (middle) and c9 (right) in the dark (red), light (green) 
ATP in wild-type and AOX knockdown lines. a, Changes in NADPH at and light + AA (blue) conditions are shown, with normalization to the internal 
different light intensities. Light and dark periods are represented by white and _ standard (methylenediphosphonate; pH 8.9). The positions of the o-, B- and 
black boxes, respectively. Light intensities were 50, 100, 200 and 400 mol y-phosphates of NTPs are shown. a, b, Representative of three independent 
quanta ms! (green, blue, red, and black traces, respectively). Chlorophyll biological samples. 


concentration was ~5 yg ml. b, Spectra from cells of wild-type (left) and 
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Extended Data Figure 9 | Cyclic electron flow in representative diatoms. (red, n = 4). Cyclic electron flow capacity was also measured for every species 
a, Total electron flow (filled symbols) was measured at different light (open circles, five independent biological samples + s.d.). b, CEF capacity was 


irradiances, as illustrated in Extended Data Fig. 3, in T. weissflogii (black,n =3 plotted against TEF. CEF and TEF are presented as mean values = s.d. from 
independent biological samples), T. pseudonana (blue, n= 2) and F. pinnata _a. The green line corresponds to CEF = 5% of the maximal total electron flow. 
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Extended Data Figure 10 | A % and photosynthesis under conditions of 
respiratory inhibition in representative diatoms. Dark respiration (a—d), 
AY, (e-h) and ETRps y (i-I) in untreated cells and after treatment with 
inhibitors of respiration antimycin A, and/or SHAM at saturating 
concentrations. a, e, i, T. weissflogii (black). b, f, j, T. pseudonana (blue). 
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Mechanism of phospho-ubiquitin-induced 


PARKIN activation 


Tobias Wauer', Michal Simicek!, Alexander Schubert! & David Komander'! 


The E3 ubiquitin ligase PARKIN (encoded by PARK2) and the 
protein kinase PINK1 (encoded by PARK6) are mutated in 
autosomal-recessive juvenile Parkinsonism (AR-JP) and work 
together in the disposal of damaged mitochondria by mito- 
phagy'’*. PINK] is stabilized on the outside of depolarized mito- 
chondria and phosphorylates polyubiquitin** as well as the 
PARKIN ubiquitin-like (Ubl) domain”’®. These phosphorylation 
events lead to PARKIN recruitment to mitochondria, and activa- 
tion by an unknown allosteric mechanism*'”. Here we present 
the crystal structure of Pediculus humanus PARKIN in complex 
with Ser65-phosphorylated ubiquitin (phosphoUb), revealing the 
molecular basis for PARKIN recruitment and activation. The 
phosphoUb binding site on PARKIN comprises a conserved phos- 
phate pocket and harbours residues mutated in patients with AR- 
JP. PhosphoUb binding leads to straightening of a helix in the 
RINGI1 domain, and the resulting conformational changes release 
the Ubl domain from the PARKIN core; this activates PARKIN. 
Moreover, phosphoUb-mediated Ubl release enhances Ubl phos- 
phorylation by PINK1, leading to conformational changes within 
the Ubl domain and stabilization of an open, active conformation 
of PARKIN. We redefine the role of the Ub] domain not only as an 
inhibitory’* but also as an activating element that is restrained in 
inactive PARKIN and released by phosphoUb. Our work opens up 
new avenues to identify small-molecule PARKIN activators. 


The RING-between-RING E3 ligase PARKIN contains a RING1 
domain that binds ubiquitin-charged E2 enzymes, and transfers 
ubiquitin from the E2 to an active site Cys residue in the RING2 
domain and subsequently to a substrate. Cytosolic PARKIN exists in 
an autoinhibited, ‘closed’ conformation’*"*, in which binding to E2 is 
blocked by the amino-terminal Ubl domain as well as by a ‘repressor’ 
element (REP), and access to the RING2 active site Cys is blocked by 
the unique PARKIN domain (UPD, also known as RINGO) (Extended 
Data Fig. 1). PhosphoUb binding and/or PARKIN Ubl phosphoryla- 
tion are presumed to induce conformational domain rearrangements to 
activate PARKIN’*"*; however, the mechanism and sequence of events 
are unclear. Once activated, PARKIN ubiquitinates numerous mito- 
chondrial and cytosolic proteins’, eventually triggering mitophagy. 

To understand how phosphoUb induces PARKIN activation, we 
used PINK1-phosphorylated ‘ubiquitin suicide probes’’* that can 
modify Cys residues near a ubiquitin binding site in vitro’’ (Fig. la 
and Extended Data Fig. 1d). Probes could not modify a previously 
crystallized construct of human PARKIN lacking the Ubl domain" 
(HsPARKIN(AUDI), amino acids 137-465) (Fig. 1b). Unexpectedly, 
a similar fragment of Pediculus humanus corporis (human body louse) 
PARKIN (amino acids 140-461, hereafter referred to as PhPARKIN) 
was modified by a subset of phosphoUb (pUb) suicide probes (Fig. 1b), 
enabling purification of the PhAPARKIN-pUb complex and determina- 
tion of a crystal structure at 2.6 A resolution (Fig. 1c, Methods and 


Figure 1 | Generation and structure of 
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The catalytic Cys in RING2, and key phosphoUb 
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comparison to HsPARKIN (Protein Data Bank 
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Extended Data Figs le and 2a). PhaPARKIN-pUb resembles autoin- 
hibited structures of HsPARKIN (Fig. 1c, d and Extended Data Fig. 2e), 
with key differences as described below. The phosphoUb suicide probe 
had modified Cys349 in the PhPARKIN in-between-RING (IBR) 
domain (Figs 1c and 2a), which in HsPARKIN corresponds to 
probe-unreactive Gln347. Notably, HsPARKIN Q347C is modified 
by phosphoUb suicide probes (Fig. 2b), indicating a similar binding 
mode of phosphoUb in HsPARKIN. Hence, our complex structure 
serves as model for phosphoUb binding to HsPARKIN. 

PhosphoUb forms an extended interface (1,150 A, 25% of ubiquitin 
surface) with the RING1 and IBR domains in PhPARKIN, and also 
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Figure 2 | PhosphoUb binding to PARKIN. a, PhosphoUb binding site on 
PhPARKIN as in Fig. 1c. Dotted lines indicate hydrogen bonds. The RING1 
B-hairpin that harbours patient mutations is highlighted in red. bb, backbone 
contacts. b, PhosphoUb suicide probe reactions as in Fig. 1b with UbC2Br and 
GST-PhPINK1. The experiment was performed three times with consistent 
results. c, Occupied Ser65 phosphate pocket in PhPARKIN (left); identical 
pocket in HsPARKIN occupied by a sulfate ion in PDB 4BM9 (ref. 14) (right). 
d, Fluorescence polarization experiments characterizing the binding of FIAsH- 
tagged phosphoUb to PARKIN variants. Measurements were performed in 
triplicate and error bars represent standard deviation from the mean. mP, milli- 
polarization unit. e, Binding assays as in d with full-length HsPARKIN and 
mutants in the phosphoUb binding site. Binding curves were compiled from 
experiments shown in Extended Data Fig. 3d. Measurements were performed 
in triplicate and error bars represent standard deviation from the mean. 
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interacts with side chains of the UPD (Fig. 2a and Extended Data 
Fig. 3). Key interactions are formed (1) via the phosphate group, 
which is located in a pocket formed by His304, Arg307 and Tyr314 
of PhPARKIN (Fig. 2c); (2) via the hydrophobic Ile44 patch of 
phosphoUb, which binds to an extended helix in the RING1 domain 
(amino acids 311-329, hereafter referred to as phosphoUb binding 
helix (pUBH)); (3) via a conserved surface B-hairpin loop (amino acids 
280-288) in RING1 that harbours AR-JP mutations; and (4) via 
the phosphoUb carboxy terminus, which forms an intermolecular 
parallel B-sheet with a B-strand of the IBR domain (Figs 1c and 2a). 
Most residues forming phosphoUb interactions in PhPARKIN are 
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f, Activity assays of full-length HsPARKIN variants with and without 
phosphoUb. After 2 h, reactions were resolved by SDS-PAGE and polyubiquitin 
visualized by anti-polyubiquitin western blotting (FK2, Millipore). PARKIN 
protein normalization is shown in Extended Data Fig. 3e. The experiment was 
performed three times with consistent results. g, YFP-HsPARKIN wild type 
or mutants were transfected into HeLa cells, treated with CCCP (10 uM) for 
1h and visualized by immunofluorescence. See Extended Data Fig. 4 for 
controls and quantification. Scale bars, 10 1m. h, HeLa cell lysates expressing 
YFP-HsPARKIN wild type or mutants were western blotted for PARKIN (after 
immunoprecipitation (IP)) and Tom20 (in whole-cell lysate (WCL)). NS, non- 
specific band. The experiment was performed at least twice as biological 
replicate for every mutant with consistent results. See Extended Data Fig. 4d and 
Supplementary Information. Molecular weight markers are in kDa for b, f, h. 
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conserved in HsPARKIN (Extended Data Fig. 3a—c), and in our pre- 
vious HsPARKIN structure’* the phosphate pocket is occupied by a 
sulfate molecule from the crystallization condition (Fig. 2c). 

A fluorescence-polarization-based phosphoUb binding assay 
revealed sub-micromolar interactions of phosphoUb with full-length 
HsPARKIN, HsPARKIN(AUDI) and PhPARKIN (Fig. 2d). Modifi- 
cation with phosphoUb suicide probes of PhAPARKIN or HsPARKIN 
Q347C (HsPARKIN Q347C-pUb) abrogated phosphoUb binding 
(Fig. 2d), indicating that the covalently bound phosphoUb molecule 
satisfied the major phosphoUb binding site. 

Mutations in the predicted phosphoUb interface in HsPARKIN 
reduced or abrogated phosphoUb binding (Fig. 2e and Extended Data 
Fig. 3d). HsPARKIN K151E (in the phosphate pocket; Fig. 2c, Ala152 in 
PhPARKIN), A320R (pUBH, Thr322 in PhPARKIN, Fig. 2a) or G284R 
(B-hairpin, Gly286 in PhPARKIN, Fig. 2a) abrogated phosphoUb bind- 
ing (Fig. 2e). H7PARKIN G284R is an AR-JP-derived patient mutation, 
and our data provide a rationale for how this mutation leads to defects 
in PARKIN function (see below). Similarly, AR-JP mutation L283P 
(ref. 20) and cancer-associated H279P (ref. 21) in this region might also 
disrupt this loop and affect phosphoUb binding to HsPARKIN. 

PARKIN activity can be assessed in autoubiquitination assays 
HsPARKIN was activated by phosphoUb, whereas HsPARKIN K151E, 
H302A, A320R or G284R showed impaired phosphoUb-induced 
activation (Fig. 2f). HsPARKIN K161N (an AR-JP mutation on the 
UPD", see below) and HsPARKIN G319A (see below) bound to and 
were activated by phosphoUb (Fig. 2e, f and Extended Data Fig. 3d, e). 

In HeLa cells, carbonyl cyanide m-chlorophenyl hydrazone 
(CCCP)-mediated depolarization of mitochondria led to rapid mito- 
chondrial localization of yellow fluorescent protein (YFP)-tagged 
HsPARKIN, while phosphoUb-binding mutants did not show 
mitochondrial localization (Fig. 2g and Extended Data Fig. 4a-c). 
Moreover, wild-type HsPARKIN ubiquitinated endogenous Tom20 
after CCCP treatment, while phosphoUb-binding mutants showed 
no apparent activity (Fig. 2h and Extended Data Fig. 4d). 
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Hence, we reveal that the phosphoUb binding site in PARKIN, 
which is conserved in divergent species, harbours AR-JP patient muta- 
tions and is important for PARKIN localization in cells (Fig. 2). This 
provides the molecular basis for PARKIN translocation to mitochon- 
drial phosphoUb”""”” (Extended Data Fig. 3g). 

Next we addressed the question of how phosphoUb activates 
PARKIN. PhosphoUb binds to a straight helix, pUBH, in PhPARKIN 
(Figs 2 and 3). In previous PARKIN structures’*”* this helix is kinked at 
Gly319 (Ala321 in PhPARKIN). The distinct conformation of the 
pUBH does not originate from this sequence difference; a crystal struc- 
ture of HsPARKIN(AUbl) G319A still shows a kinked pUBH con- 
formation (Extended Data Figs le and 5) and HsPARKIN wild type 
and G319A have similar biochemical properties (Fig. 2e, fand Extended 
Data Fig. 3d, e). Notably, RING] of the RBR E3 ligase HHARI (ref. 23) 
also features a kinked helix in the autoinhibited state (see Extended Data 
Fig. 5e-h). A kinked helix would be unable to bind phosphoUb, leading 
to a model in which the pUBH is dynamic and straightens upon 
phosphoUb binding. pUBH straightening hardly affects RINGI 
(Extended Data Fig. 6a) but impacts on the position of the IBR domain 
(Figs 2 and 3a, b), which rotates and moves by >20 Aas compared to 
full-length rat PARKIN (RnPARKIN)"® (Fig. 3a). The conformational 
change stretches the IBR-REP linker (14-15 amino acids, Extended 
Data Fig. 2a) from spanning 31 A in RnPARKIN to cover a distance 
of 43 A in PhPARKIN-pUb (Fig. 3b and Extended Data Fig. 6b). This 
appears to destabilize inhibitory interactions of REP and RING2 as 
suggested by increased B-factors for these domains in PhPARKIN- 
pUb (Extended Data Fig. 6c, d). 

More importantly, phosphoUb binding also destabilizes the inter- 
face between the PARKIN Ubl domain and the RBR core, due to 
displacement of the IBR domain and reorganization of the IBR-REP 
linker that no longer spans the Ubl surface (Fig. 3b). Using isothermal 
titration calorimetry (ITC), we detected binding of isolated HsPARKIN 
Ubl (amino acids 1-72) to HsPARKIN(AUDI) in trans (Fig. 3c, dissoci- 
ation constant (Kg) ~40 uM). Notably, binding is undetectable in the 


Figure 3 | Conformational changes due to 
phosphoUb binding. a, Straight pUBH in 
PhPARKIN-pUb (left) versus kinked pUBH in 
previous PARKIN structures (middle; shown is 
full-length RnPARKIN, PDB 4K95 (ref. 16)). 
Superposition indicating IBR repositioning 

by >20A (right). b, Full-length RnPARKIN (top) 
with core domains under a transparent surface and 
Ubl as green cartoon, and PhPARKIN-pUb 
(bottom), with a green circle indicating the putative 
UbIl binding site; phosphoUb is in orange. c, ITC 
experiment titrating HsPARKIN Ub! domain into 
HsPARKIN(AUDI) without phosphoUb (blue 
curve, Ky ~39 1M), with phosphoUb present 
(black curve, Kg not detectable (n.d.)), and 
HsPARKIN K151E with phosphoUb present (red 
curve, Ky ~ 69 1M). Heat release curves (top panel) 
were shifted in their y-axis position for better 
visibility. Dissociation constant and curve fitting 
errors are indicated. All measurements were 
performed three times with consistent results. 

d, Activity of full-length HsPARKIN, 
HsPARKIN(AUbI) (137-465), HsPARKIN(AUbI2) 
(80-465) and SUMO-tagged HsPARKIN(AUbI2) 
(80-465) with or without phosphoUb for 1h (see 
Fig. 2f and Extended Data Fig. 3f). Experiments were 
performed three times with consistent results. 
Molecular weight markers are in kDa. 
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presence of phosphoUb, but recovered with phosphoUb-binding-defi- 
cient HsPARKIN(AUbDI) K151E mutant (Fig. 3c). Hence, phosphoUb 
binding releases the Ubl from the PARKIN RBR core. 

PARKIN variants lacking the Ubl domain are still autoinhibited’** 
and less well activated by phosphoUb in comparison to full-length 
PARKIN’ (Fig. 3d). This indicates that the presence of the Ubl domain 
is important for full PARKIN activity. Interestingly, replacing the Ubl 
domain (amino acids 1-79) with SUMO, which lacks a Ubl-like hydro- 
phobic patch and would not bind RINGI, activates PARKIN constitu- 
tively, even in the absence of phosphoUb. This suggests that the released 
Ubl domain actively helps to unravel the autoinhibited PARKIN confor- 
mation. This, together with destabilization of the REP and RING2 auto- 
inhibitory interactions, enables RING1 to bind and discharge E2-Ub 
conjugates’, and explains how PARKIN is activated by phosphoUb. 

An alternative mechanism to activate PARKIN is PINK1-mediated 
phosphorylation of Ser65 in the Ubl domain”?”®, which was also sug- 
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gested to release the Ubl domain from the PARKIN core*™. In the 
closed conformation of PARKIN, the Ubl domain binds via its Ile44 
patch to RING (refs 13, 16) (Fig. 4a and Extended Data Fig. 1). 
Importantly, PINK1 did not phosphorylate HsPARKIN Ubl 144A 
(or ubiquitin 144A) efficiently (Extended Data Fig. 7a, b). This suggests 
that PARKIN and PINK] utilize overlapping binding sites on the Ubl 
domain, and that the Ubl domain has to be released from the PARKIN 
core for PINK] to access and phosphorylate it. 

Consistently, using a phosphospecific antibody against PARKIN 
phospho-Ser65 (anti-pSer65 PARKIN), we found that PARKIN phos- 
phorylation is significantly enhanced when phosphoUb is added to 
the reaction, and this depends on ubiquitin phosphorylation and 
phosphoUb binding (Fig. 4c and Extended Data Fig. 7c, d). Induced 
release of the Ubl domain by mutating the binding site on RING] 
(HsPARKIN L266R, Fig. 4a) leads to phosphorylation by PINK] in 
the absence of phosphoUb (Fig. 4c and Extended Data Fig. 7c, d), 
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‘Closed’ inactive PARKIN Dynamic intermediates 


Figure 4 | PhosphoUb induces PARKIN UbI phosphorylation and model 
of PARKIN activation. a, Detail of the Ubl position on PARKIN. Ubl Ile44 
and interacting Leu266 on RING] are highlighted. The mostly disordered IBR- 
REP linker hovers above Ser65 and moves out of the way in PhPARKIN-pUb 
(compare Fig. 3b). b, Time course of phosphorylation of HsPARKIN by 
GST-TcPINK1 (Tribolium castaneum PINK1), in the absence and presence 
of phosphoUb at 37 °C, visualized with anti-pSer65 PARKIN antibody (Abcam 
catalogue number ab154995) (top) and Coomassie (bottom). The experiment 
was performed three times with consistent results. c, Experiment as in b but 
performed at 22 °C with lower phosphoUb concentration (5.8 1M). The 
experiment was performed three times with consistent results. Also see 
Extended Data Fig. 7c, d. d, Model of PARKIN activation. PARKIN 


PhosphoUb-induced UbI release 
and interface destabilization 


‘Open’ active PARKIN 


is autoinhibited (top left) by multiple mechanisms (Extended Data Fig. 1). 
Inactive PARKIN can interact with phosphoUb, which releases the Ubl 
domain, destabilizes inhibitory interactions and ‘opens’ PARKIN (top row). 
This process is reversible. PARKIN can also be phosphorylated by PINK1, 
either directly (bottom left) or with improved kinetics after phosphoUb binding 
(bottom middle). Phosphorylated Ubl undergoes a conformational change 
(Extended Data Fig. 8), probably preventing it from reverting back to the 
autoinhibited state. Phosphorylation of PARKIN hence stabilizes an open, 
active conformation (bottom right). Thickness of arrows indicates preferred 
routes. Further information on the role of Ubl phosphorylation is in Extended 
Data Figs 9 and 10. Molecular weight markers are in kDa for b and c. 
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indicating that Ubl release is crucial for enhanced phosphorylation. 
These results reveal a new function for phosphoUb, namely to enable 
phosphorylation of the Ubl domain (Fig. 4). However, PINK1 can 
phosphorylate PARKIN in the absence of phosphoUb in vitro (albeit 
inefficiently, Fig. 4b) and in cells’”*, showing that PARKIN is a 
dynamic molecule (Fig. 4d) in which the Ubl domain is partially 
accessible by PINK]. 

We next examined consequences of Ubl phosphorylation in 
HsPARKIN. Nuclear magnetic resonance measurements reveal that 
the Ubl domain undergoes significant changes when phosphorylated, 
in particular in the Ser65 loop and the Ile44 patch (Extended Data 
Fig. 8). Disruption of the Ile44 patch prevents re-binding of the Ubl 
domain to RING1"*”, and probably stabilizes a more open, active 
conformation of PARKIN. Moreover, this may also explain why the 
phosphorylated Ubl domain cannot compete with phosphoUb for the 
phosphoUb binding site (Extended Data Fig. 9). Phosphorylated 
HsPARKIN does not impede binding of phosphoUb, but rather 
enhances it’ (Extended Data Fig. 9). 

Together, this completes our model of PARKIN activation (Fig. 4d). 
Autoinhibited PARKIN (Fig. 4d, top left) responds to phosphoUb on 
mitochondria, which releases the Ubl to activate PARKIN in a revers- 
ible manner (Fig. 4d, top row). PINK1 phosphorylates PARKIN pref- 
erentially when the Ubl domain is released (Fig. 4d, bottom middle), 
and this leads to irreversible PARKIN activation (Fig. 4d, bottom 
right). Alternatively, inactive PARKIN may be phosphorylated by 
PINK] directly’”* (Fig. 4d, bottom left); this improves phosphoUb 
binding, retains PARKIN on mitochondria, and irreversibly activates 
PARKIN (Fig. 4d, bottom row). 

The conformation of fully active PARKIN remains elusive. Phos- 
phorylated PARKIN but not phosphoUb-activated PARKIN exposes 
its active site Cys residue’ (Extended Data Fig. 10), further indicating 
that PARKIN phosphorylation leads to ‘opening’ of PARKIN. Notably, 
mutations in a putative second phosphate pocket in the UPD, which we 
reported previously* and which is distinct from the pocket involved in 
phosphoUb binding, prevents phospho-Ubl-induced PARKIN open- 
ing and activation (Extended Data Fig. 10c, d). The functional link 
between PARKIN phosphorylation and a putative phosphate-binding 
pocket in the UPD may suggest that the phosphorylated Ubl domain 
binds back to the UPD; however, alternative activation mechanisms or 
the involvement of the Ubl-UPD linker cannot be excluded. 

Our work is consistent with suggested models of PARKIN-mediated 
mitophagy’*>** and provides a structural understanding for phosphoUb 
binding and allosteric PARKIN activation. We refine the role of the 
PARKIN Ubl domain as an essential activating element that is restrained 
in autoinhibited PARKIN. The model ensures tight temporal and spatial 
regulation of PARKIN activity, and incorporates a commitment step 
whereby PARKIN phosphorylation locks PARKIN in the active, open 
conformation. Our insights may prove useful pharmacologically, 
since small molecules that dislodge the PARKIN Ubl domain from 
the PARKIN core may activate PARKIN and benefit patients with 
Parkinson’s disease. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

No statistical methods were used to predetermine sample size. 

Protein expression and purification. HsPARKIN full length, Hs7PARKIN(AUbI) 
(amino acids 137-465), PhPARKIN (amino acids 140-461), PhPINK1 (amino 
acids 115-575) as well as Tribolium castaneum PINK1 (TcPINK1, amino acids 
128-570) were expressed as GST fusion proteins in Rosetta2 pLacl cells from 
pOPIN-K vectors as described’. In short, PARKIN cultures were induced by 
adding 200}.M ZnCl, and 50M IPTG, whereas PINK1 variants and the 
HsPARKIN Ubl domain (amino acids 1-72) were induced with 150 uM IPTG 
followed by 12h expression at 18 °C. After collection, the cell pellet was lysed by 
sonication in lysis buffer (270mM sucrose, 10mM glycerol 2-phosphate di- 
sodium, 50 mM NaF, 14mM f-mercaptoethanol, 50 mM Tris (pH 8.0)), in the 
presence of lysozyme, DNase I and EDTA-free protease inhibitors. The suspen- 
sion was centrifuged and the supernatant applied to Glutathione Sepharose 4B 
beads (GE Healthcare). After 1h of agitation at 4 °C the beads were washed with 
high salt buffer (500 mM NaCl, 10mM DTT, 25 mM Tris (pH 8.5)) and equili- 
brated in low salt buffer (200 mM NaCl, 10 mM DTT, 25 mM Tris (pH 8.5)). GST- 
tagged proteins were either eluted from beads with low salt buffer containing 
40mM glutathione and purified by gel filtration (Superdex 200, GE Life 
Sciences), or cleaved on beads by incubating with GST-3C protease for 12h at 
4°C before further purification by gel filtration (Superdex 75, GE Life Sciences) in 
low salt buffer as a final step. When improved purity was required, such as for 
PhPARKIN used in crystallization, an anion exchange step (RESOURCE Q, GE 
Life Sciences) using a linear gradient of 75-600 mM NaCl in 10 mM DTT, 25 mM 
Tris (pH 8.5) was included before final gel filtration. 

N-terminally His6-SUMO-tagged HsPARKIN(AUDI2) (amino acids 80-465) 
was expressed as described above and lysed in His6-lysis buffer (200-300 mM 
NaCl, 4mM f-mercaptoethanol, 25mM Tris (pH 8.5)). After centrifugation, 
the protein was affinity purified with Talon Superflow resin (GE Healthcare) 
and eluted in 200mM NaCl, 4mM f-mercaptoethanol, 25mM Tris (pH 8.5) 
containing 200-250 mM imidazole. Eluted protein was directly applied to gel 
filtration (Superdex 75, GE Life Sciences) in low salt buffer. Protein for SUMO- 
tag cleavage was dialysed overnight in 200 mM NaCl, 4mM f-mercaptoethanol, 
25 mM Tris (pH 8.5) containing His6-tagged SENP 1. The sample was reapplied to 
Talon resin, and the flow-through purified by gel filtration (Superdex 75, GE Life 
Sciences) in low salt buffer. 

Phospho-HsPARKIN for biochemical assays was generated by incubating 
32-37 UM HsPARKIN, 5.4 UM GST-PhPINK1 and 10 mM ATP with 1X ligation 
buffer (40 mM Tris pH 7.4, 10 mM MgCl, 0.6 mM DTT) for 1h at room temper- 
ature. GST-PhPINK1 was removed with Glutathione Sepharose 4B resin (GE 
Healthcare) and HsPARKIN was re-purified by gel filtration (Superdex 75, GE 
Life Sciences). Consistent phosphorylation levels were checked by western blot 
analysis using an anti-pSer65 PARKIN antibody (Abcam cat no. ab154995). 
Modification with ubiquitin-based suicide probes. Probe reactions for bio- 
chemical assays were performed by incubating 5 4M PARKIN with 40 uM indi- 
cated ubiquitin suicide probe (Figs 1b and 2b) or 20 4M Ub-VS (Extended Data 
Fig. 10), 1 pM PhPINK1] where indicated and 5 uM phosphoUb where indicated in 
the presence of 1 X reaction buffer (50mM Tris pH 7.4, 200mM NaCl, 10 mM 
DTT, 10mM MgCl,). The reaction took place at room temperature after adding 
10 mM ATP and was quenched by adding LDS sample buffer at the indicated time 
points. Samples were applied on NuPAGE 4-12% Bis-Tris gels (Invitrogen) and 
stained with Instant Blue SafeStain (Expedeon). 

To generate the covalent PhPARKIN-pUb complex for crystallization, 46 1M 

PhPARKIN (amino acids 140-461) was reacted with 230 1M UbC3Br probe and 
23 uM GST-PhPINK1 in the presence of 1 X reaction buffer. The coupling was 
initiated by adding 10 mM ATP and incubated for 6h at room temperature. The 
complex was purified by gel filtration (Superdex 75, GE Life Sciences) in low salt 
buffer. Fractions containing PhPARKIN-pUb were pooled, concentrated and 
used for crystallization without freezing. 
Crystallization, data collection and refinement. PhPARKIN-pUb was crystal- 
lized in sitting-drop vapour diffusion at a concentration of ~5.3 mg ml‘ at 18 °C. 
Crystals were grown in 100 nl protein solution mixed with 100 nl mother liquor 
(2% (v/v) PEG400, 2 M ammonium sulfate, 0.1 M HEPES pH 7.5). Before vit- 
rification, crystals were soaked in 1.7% (v/v) PEG400, 15% (v/v) glycerol, 1.7 M 
ammonium sulfate, 0.085 M HEPES (pH 7.5) for cryo-protection. Diffraction data 
were collected at the Diamond Light Source (Harwell, UK), beamline I-04, at 100 K 
and a wavelength of 0.97949 A, to a resolution of 2.62 A. 

The HsPARKIN(AUDI) (amino acids 137-465) G319A mutant was crystallized 
at a concentration of 2.4mg ml’ by mixing 400 nl protein solution with 400 nl 
mother liquor (1.8 M lithium sulfate, 0.01 M MgCl, 0.05M MES pH 5.6) ina 
sitting-drop vapour diffusion set-up at 18 °C. Before vitrification in liquid nitrogen 
the crystals were briefly soaked in 1.6 M lithium sulfate, 0.01 M MgCh, 0.05 M 
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MES pH 5.4 containing 15% (w/v) glycerol. Diffraction data were collected at the 
Diamond Light Source (Harwell, UK), beamline 104-1, at 100 K and a wavelength 
of 0.91730 A, to a resolution of 2.35 A. 

Phasing of the PhPARKIN-pUb data set was performed by molecular replace- 
ment with Phaser” using isolated domains of HsPARKIN(AUbI) (PDB 4BM9 
(ref. 14)) and ubiquitin (PDB 1UBgq (ref. 30)) as search models. The structure of 
HsPARKIN(AUDI) G319A was solved by using HsPARKIN(AUbI) (PDB 4BM9 (ref. 
14)) as a refinement model. For both structures, subsequent rounds of model building 
in Coot*! and refinement in Phenix” resulted in final models with statistics shown in 
Extended Data Fig. le. The HsPARKIN(AUbI) G319A structure was refined with 
simulated annealing to reduce model bias. Both structures were refined with TLS, 
using different protein chains as individual TLS groups. Final Ramachandran stat- 
istics were 95.3%/4.7%/0.0% (favoured/allowed/outliers) for the H7PARKIN G319A 
mutant structure, and 96.7%/3.2%/0.1% for the PhPARKIN-pUb structure. 
Structure figures were generated with PyMol (http://www.pymol.org). 
Fluorescence polarization phosphoUb binding assays. N-terminally FIAsH- 
tagged ubiquitin was phosphorylated and purified as described for phosphoUb* 
with buffers supplemented with B-mercaptoethanol. Labelling was performed 
overnight at 4°C with 60 1M FIAsH-tagged phosphoUb at a ratio of 37.5:1 (v/v) 
with Lumio Green (Invitrogen) in 1 < FIAsH dilution buffer (50 mM Tris (pH 7.4), 
1mM f-mercaptoethanol). Buffer exchange was performed with PD-10 desalting 
columns (GE Healthcare) in 50 mM Tris (pH 7.4), 2mM [-mercaptoethanol and 
samples were concentrated to ~8 1M FIAsH-labelled phosphoUb. For binding 
studies in 384-well low volume plates (Corning), 10 pl of 100nm labelled 
phosphoUb was mixed with 10 ul of PARKIN serial dilutions in FIAsH buffer 
(20mM Tris (pH 7.4), 100mM NaCl, 2mM f-mercaptoethanol, 0.1 mg ml! 
bovine serum albumin). Fluorescence polarization (FP = (I|]| —11)/(Il +11)) 
was measured using a PheraStar plate reader (BMB Labtech) with the optic module 
set to 1... = 485 nm and /.., = 520 nm. Measurements were performed in triplicate 
and error bars are given as the standard deviation from the mean. A least square fit 
for one binding site was performed using the following equation 


FP = (Bmax X X/(Ka +X))+NS x X + background 


with FP being fluorescence polarization and X the concentration of the titrant, Byrax 
is the maximum specific binding, Kg is the equilibrium dissociation constant and 
NS the slope for nonspecific binding, which was restricted to values greater than 0. 
PARKIN activity assays. Spin-filtered HsPARKIN (2 |1M) was pre-incubated for 
0.5h at 30°C with 10mM ATP, 1 X ligation buffer (40 mM Tris pH 7.5, 10 mM 
magnesium chloride, 0.6mM dithiothreitol (DTT)), a total concentration of 
0.5 mg ml~? ubiquitin and 0.05 mg ml~' phosphoUb or 0.1 14M GST-TcPINK1 
where indicated. Ubiquitination was initiated by adding 0.1}1M El and 1M 
UBE2L3 (Fig. 2f and Extended Data Fig. 10d) or 0.2 1M E1 and 2 4M UBE2L3 
(Fig. 3d). The reaction was quenched with LDS sample buffer containing DTT and 
iodoacetamide to prevent forming of disulfide bridges. NuPAGE 4-12% gradient 
Bis-Tris gels were used for separation, and proteins were transferred on a nitro- 
cellulose membrane with subsequent detection using an anti-polyubiquitin FK2 
antibody (Millipore). 

Immunofluorescence. HeLa cells (originating from ATCC) were nucleofected 
with N-terminally tagged eYFP—HsPARKIN (gift from the J. Kittler laboratory) 
and grown on coverslips for 24-48h. After treatment with DMSO or CCCP 
(10 1M) for 1h, cells were fixed with 4% paraformaldehyde, washed with 0.1M 
glycine in phosphate buffered saline (PBS), at pH 7.4, briefly permeabilized with 
0.2% Triton X-100 in PBS and blocked with a blocking solution containing 10% 
goat serum and 0.5% BSA. Samples were further incubated with anti-Tom20 
antibody (FL-145, Santa Cruz) followed by goat Alexa647-coupled anti-rabbit 
antibody (Life Technologies). Confocal images were taken using a Zeiss 
LSM780 microscope. 

Immunoblotting and immunoprecipitation. HeLa cells (originating from 
ATCC) were transfected with eYFP—HsPARKIN. After 24-48 h, cells were treated 
with DMSO or CCCP as before, and lysed in cell lysis buffer (50 mM Tris (pH 7.4), 
150 mM NaCl, 1% (v/v) NP-40) supplemented with EDT A-free protease inhibitor 
cocktail (Roche), PhosphoSTOP (Roche), as well as 10 mM N-ethylmaleimide 
(Sigma-Aldrich) and 10 mM chloracetamide (Sigma-Aldrich) for better detection 
of ubiquitinated proteins. For immunoprecipitation of eY FP-HsPARKIN, 500 pg 
of cell lysate was incubated with GFP-Trap agarose (Chromotek) for 1h. The 
beads were washed three times with cell lysis buffer, and proteins were eluted with 
1X LDS buffer. Cells were regularly checked for the absence of mycoplasma 
infection using the MycoAlert Kit (Lonza). Antibodies were from commercial 
sources: goat anti-GFP (ab6673, Abcam), rabbit anti-Tom20 (FL-145, Santa 
Cruz), mouse anti-GAPDH (6C5, Ambion). 

Isothermal titration calorimetry. Isothermal titration calorimetry (ITC) experi- 
ments were performed in a MicroCal iTC200 machine (GE Healthcare) at 20 °C 
with the sample and the ligand in low salt buffer. The cell contained 35 1M of 
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HsPARKIN(AUDI) and 400 uM HsPARKIN UbI was injected in 2 il injections at 
120s intervals. Protein sample as well as ligand were in low salt buffer and 
HsPARKIN(AUDI) was mixed with phosphoUb at a 1:1.2 molar ratio as indicated. 
Binding curves were integrated and fitted to a one-site binding model by using the 
MicroCal ITC Analysis plug-in for Origin (Malvern). 

Phosphorylation assays. PARKIN phosphorylation was performed by incubating 
5 uM HsPARKIN with 0.5 1M GST-TcPINK1, 10mM ATP, 1 X reaction buffer 
and phosphoUb (14 UM unless stated differently). The reaction was quenched at 
the given time points with LDS sample buffer and proteins were separated on a 
NuPAGE 4-12% gradient Bis-Tris gel, transferred on nitrocellulose membrane 
and detected with anti-pSer65 PARKIN antibody (Abcam cat no. ab154995). 
Phosphorylation assays of HsPARKIN Ubl domain (amino acids 1-72) and 
ubiquitin were performed as described above with 2011.M of HsPARKIN Ubl 
domain and ubiquitin, respectively. For the ubiquitin phosphorylation assay, 
the GST-TcPINK1 concentration was increased to 1.5M. The reaction was 
quenched at the given time points with LDS sample buffer and proteins were 
separated on a 15% SuperSep Phos-tag gel (Wako Chemicals) and stained with 
Instant Blue SafeStain (Expedeon). 

Phosphorylation of the PARKIN Ubl domain for NMR analysis. Isotope- 
labelled HsPARKIN Ubl domain (amino acids 1-72) was expressed in M9 min- 
imal media supplemented with 4g 1”* '°C-glucose, 2g 1”’ ‘"N-NH,Cl, trace 
elements and BME vitamins (Sigma-Aldrich) and purified as described above. 
The final gel filtration was performed in NMR buffer (18 mM Na,HPO,, 7 mM 
NaH,PO, (pH 7.2), 150mM NaCl, 5mM TCEP). Isotope-labelled HsPARKIN 
Ubl was phosphorylated at room temperature by incubating 80 WM HsPARKIN 
Ubl with 2.5 uM PhPINK1, 1 mM ATP and 1 * ligation buffer which was adjusted 
to 332.5 tl with NMR buffer, before addition of 17.5 tl DjO as lock solvent. The 
reaction was monitored by consecutive HN 2D BEST-TROSY (band selective 
excitation short transients transverse relaxation optimized spectroscopy) experi- 
ments and quenched with apyrase. 

Solution studies of the phosphorylated PARKIN UbI domain. NMR acquisi- 
tion was performed at 298 K on a Bruker Avance III 600 MHz spectrometer 


equipped with a cryogenic triple resonance TCI probe. The software packages 
Topspin3.2 (Bruker) and Sparky (Goddard & Kneller, UCSF; http://www.cgl.ucsf. 
edu/home/sparky/) were used for data processing and analysis, respectively. 
*H,'°N 2D BEST-TROSY experiments®? were conducted with in-house optimized 
Bruker pulse sequences that contained a recycling delay of 400 ms and 512 x 64 
complex points in the 'H,'°N dimension, respectively. 

Standard HSQC-based Bruker triple resonance pulse sequences were used to 
generate backbone chemical shift assignments. CBCACONH and HNCACB spec- 
tra were collected with 50% non-uniform sampling (NUS) of 1,024 x 32 x 55 
complex points in the 'H, '°N and °C dimensions. HNCO and HNCACO experi- 
ments were acquired using NUS at a rate of 50% with 1,024 x 32 X 48 complex 
points in the 'H, '°N and °C dimensions, respectively. Data set processing was 
performed with compressed sensing using the MddNMR software package™*. 
Weighted chemical shift perturbation calculations were completed using the equa- 
tion \/((A*H)? + (A?°N/5)’). 
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RnPARKIN (4k95) REP blocks Ubi bios 
E2~Ub binding E2 binding 
Disordered loop UBIUPD 
Disordered linker - de 
REP-RING2 ~~ 
RING2 e' 
catalytic Cys 
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b 
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a ee ee ee GST-PHPINK1 
49 
38 
28 4 
17 
14 
6 4 phospho-UbC3Br 
3 UbC3Br 
~~ 
Coomassie / Phos-tag 
e 
Data collection statistics. Numbers in brackets are for the highest resolution bin. 
PhPARKIN~phosphoUb HsPARKIN G319A 
Data collection 
Space group P2, H3, 
a, b, c (A) 49.17, 44.12, 204.53 169.36, 169.36, 96.99 
a, B, y (°) 90, 92.55, 90 90, 90, 120 
Resolution (A) 34.06 - 2.62 (2.74 - 2.62) 86.68 - 2.35 (2.43 - 2.35) 
Rnevae 0.085 (0.389) 0.089 (0.802) 
<//ol> 6.3 (2.0) 13.3 (2.1) 
Completeness (%) 99.6 (99.6) 100 (100) 
Redundancy 3.0 (3.1) 6.9 (6.8) 
Refinement 
Resolution (A) 34.06 — 2.62 84.68 — 2.35 
No. reflections (test set) 26558 (1330) 22270 (1148) 
Ryo / Riree 0.228 / 0.260 0.198 / 0.229 
No. atoms 
Protein 5569 (765 aa) 2357 (306 aa) 
Ligand/ion 25 48 
Water 33 81 
B-factors 
Protein 70.2 54.3 
Ligand/ion 115.1 88.9 
Water 50.3 47.1 
R.m.s deviations 
Bond lengths (A) 0.005 0.003 
Bond angles (°) 0.620 0.691 


Datasets were collected and structures determined from a single crystal each. 
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Extended Data Figure 1 | Autoinhibited PARKIN and phosphoUb probes. 
a, Structure of RniPARKIN (PDB 4K95 (ref. 16), chain A is used for all 
representations of RiPARKIN) with Ubl domain coloured in green, the unique 
PARKIN domain (UPD) in dark blue, RING] in blue, IBR in light blue, REP 
in red, and RING? in cyan. Zinc atoms are shown as grey spheres. The catalytic 
Cys431 is shown in ball-and-stick representation. Disordered linkers are 
indicated as dotted lines. The three mechanisms of PARKIN inhibition 
indicated in red. b, Schematic diagram of the closed, autoinhibited confor- 
mation of full-length PARKIN. c, Superposition of the E2-Ub complex from 
the BIRC7 structure (PDB 4AUQ (ref. 35)), superposed via its RING domain 
onto RING of full-length RnPARKIN (PDB 4K95 (ref. 16)) to indicate 


the position of the E2-Ub on PARKIN RINGI. Assuming that E2-Ub adopts a 
canonical conformation on RINGI, the E2 would clash with the Ubl domain 
and partially with the REP, while the E2-linked ubiquitin would clash with 
the REP. Hence, Ubl and REP have to be released to enable PARKIN E2-Ub 
binding at RINGI. d, Time-course analysis of an exemplary reaction of UbC3Br 
(0.2 mg ml *) phosphorylated by GST-PhPINK1 (5 uM) as described 
previously for ubiquitin’. Phosphorylation of proteins was monitored by a band 
shift on Coomassie-stained Phos-Tag gels. The experiment was performed two 
times with consistent results. Molecular weight markers are in kDa. e, Data 
collection and refinement statistics. 
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Extended Data Figure 2 | PhPARKIN similarity with human and rat 
PARKIN and map quality. a, Structure-based sequence alignment of 
HsPARKIN from human (top), RnPARKIN (middle) and Pediculus humanus 
corporis (PhPARKIN, bottom). The domains are indicated in boxes coloured 
according to structural figures. HsPARKIN and PhPARKIN are 45% 
identical within their crystallized constructs. Secondary structure elements are 
shown for HsPARKIN Ubl domain (PDB IIYF (ref. 36)) and for HSPARKIN 
core domain (4BM9 (ref. 14)). Red asterisks denote the phospho-Ser65 
ubiquitin binding pocket, and yellow spheres the residues contacting 
phosphoUb (see Fig. 2). A black asterisk denotes the catalytic Cys in RING2. 
b, Stereo representation of the asymmetric unit of PAPARKIN-pUb crystals, 


showing 2|F,| — |F.| electron density at 1o, in blue for PhPARKIN and in 
green for phosphoUb. c, Electron density detail, shown as in b, zooming in on 
phosphoUb phospho-Ser65, in stereo representation. d, Superposition of the 
two PhPARKIN-pUb complexes in the asymmetric unit, coloured as in Fig. 1. 
The r.m.s.d. is 0.76 A. Electron density is missing for parts of the flexible linker 
between IBR and RING2. e, Superposition of available PARKIN structures 
(PDB 4BM9 (ref. 14); 411H (ref. 15); 4K95 (ref. 16) and two PhPARKIN-pUb 
complexes) in different colours, showing similar domain positions with respect 
to each other, with the exception of the IBR domain. Only the structure of full- 
length RnPARKIN" contains the Ubl domain. 
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Extended Data Figure 3 | Conservation of phosphoUb interacting residues 
and biochemical analysis. a, b, An alignment of PARKIN from species 
available in Ensembl (http://www.ensembl.org) was curated by removing 
sequences with truncations or poorly sequenced regions. The residues involved 
in phosphoUb binding, comprising (a) the phosphate pocket and (b) the 
pUBH, are shown, and residues contacting phosphoUb are highlighted. 

c, Comparison of residues from PhPARKIN and HsPARKIN involved in 
phosphoUb binding. Highlighted in green is the HsSPARKIN -hairpin residue 
Gly284 (Gly286 in PhPARKIN), which is mutated to Arg in AR-JP. Other 
B-hairpin patient mutations, L283P (Phe285 in PhPARKIN) and H279P 
(Ser281 in PhPARKIN), introduce Pro residues, probably distorting the B- 
hairpin loop. The notation (bb) indicates backbone interaction. d, Fluorescence 
polarization assays performed with mutant PARKIN and phosphoUb. Assays 
from two independent experiments were combined to produce Fig. 2e. 
Measurements were performed in triplicate with error bars given as standard 


deviation from the mean. e, Coomassie-stained SDS-PAGE gel showing 
HsPARKIN proteins used in activity assays. f, Normalized proteins for 
PARKIN activity assays in Fig. 3d. g, Analogous to Fig. 2a, phosphoUb bound 
to PhAPARKIN is shown with Lys residues in stick representation, and free 
amine groups as blue spheres. Several Lys side chains (Lys29, Lys33, Lys48 and 
Lys63) were disordered in the electron density maps, suggesting high flexibility 
and solvent accessibility, and were modelled in their preferred side-chain 
rotamer for illustrative purposes. When bound to PARKIN, each Lys residue 
can be ubiquitinated, and the C terminus which is covalently attached to 
PhPARKIN in the complex structure is probably more flexible and could also be 
attached to a more proximal (phospho) Ub in a chain. This indicates that 
PARKIN could interact with phosphoUb-containing polyubiquitin chains that 
were reported to be the PINK1 substrate on mitochondria”*"’. Molecular 
weight markers are in kDa in e and f. 
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Extended Data Figure 4 | PARKIN phosphoUb-binding mutants do not 
translocate to mitochondria. a, b, HeLa cells that do not contain detectable 
levels of PARKIN were transiently transfected with YFP-HsPARKIN wild type 
and indicated mutants (green). Tom20 staining with anti-Tom20 antibody 
indicates mitochondria (red). Images are representative of three biological 
replicates. a, In DMSO-treated control cells, PARKIN does not co-localize with 
mitochondrial Tom20. b, After treatment with CCCP (10 1M) for 1h, wild- 
type PARKIN but not phosphoUb-binding mutants co-localize with Tom20. 
Scale bars, 10 tum. Split channels are shown with overlay to illustrate 
co-localization. The YFP channel is identical to Fig. 2g. Digital zoom 10. 

c, Quantification of cells with PARKIN localized at mitochondria in b, scored 
for 150 cells per condition in three biological replicates. PhosphoUb binding 
mutants do not show sustained mitochondrial localization. Error bars represent 


standard error of the mean. d, HeLa cells transiently transfected with 
HsPARKIN wild type and mutants were treated with CCCP and YFP- 
HsPARKIN immunoprecipitated (see Methods). Whole-cell lysates (WCL) or 
immunoprecipitates (IP) were western blotted for PARKIN, Tom20 and 
GAPDH (loading control) as indicated. Ubiquitinated forms of PARKIN and 
Tom20 can be observed with wild-type PARKIN but not with PARKIN 
mutants. See also Fig. 2h where the A320R mutant was also included. These 
experiments were performed three times as biological replicates with similar 
results. Note that PARKIN autoubiquitination varied and was weaker in some 
experiments, while Tom20 ubiquitination was more robust. Molecular weight 
markers are in kDa. See Supplementary Information for uncropped blots. 
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Extended Data Figure 5 | The kink in the pUBH. a-d, To understand 
whether the straight pUBH in PhPARKIN was a consequence of a helix- 
favouring mutation of Gly319 (HsPARKIN) to Ala (Ala321 in PhPARKIN), the 
HsPARKIN(AUbI) G319A mutant was crystallized and a structure determined 
at 2.35 A resolution (see Extended Data Fig. le and Methods). a, Superposition 
of HsPARKIN(AUbl) G319A and wild-type HsPARKIN(AUbI) (PDB 4BM9 
(ref. 14)). The structures are virtually identical, both containing a kinked helix. 
b, c, Electron density detail of the kinked helix for the mutant (b) and wild 
type (c), shown in stereo representation, with 2|F,| — |F.| density contoured at 
lo. The CB atom of Ala319 is clearly defined in electron density and does not 
induce helix straightening in this crystallographic setting. d, Comparison of 
pUBH helices in HsPARKIN(AUbI) G319A (left), HsPARKIN(AUDI) wild type 
(second from left), superposition of the two (second from right), and for 
comparison the straight pUBH in PhPARKIN-pUb. e, f, Structure of the RBR 


E3 ligase HHARI (e) in an autoinhibited form (PDB 4KBL (ref. 23)) showing an 
entirely different RBR module as compared to full-length RnPARKIN (PDB 
4K95 (ref. 16)) in f. RING1, IBR, RING2 domains are coloured as for PARKIN 
in Fig. 1, and other domains (UBA-like domain and Ariadne domain in 
HHARI) are coloured in red. Interestingly, the pUBH equivalent helix in 
HHARI is kinked at a similar position as compared to mammalian PARKIN. 
g, Superposition of structures from e and f on their RING1 domains. In 
HHARI, the pUBH-equivalent helix kinks in a different manner as compared 
to HsPARKIN. h, In HHARI, the kinked helix seems to be stabilized by an 
interaction in cis with the UBA-like domain that binds NEDD8 (ref. 37). The 
sequence of the helix does not contain Gly residues, but is kinked at a Thr residue 
(Thr263). It will be interesting to see whether helix straightening occurs in active 
forms of HHARI that can be induced by binding NEDD8-modified cullins’’. 
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Extended Data Figure 6 | Structural detail and B-factor analysis. 

a, PhosphoUb-induced pUBH straightening is energetically neutral as the two 
hydrogen bonds between helix residues and the RINGI core are not lost but 
only adjusted. Left: full-length RnPARKIN structure (PDB 4K95 (ref. 16)). 
Right: PhAPARKIN-pUb structure. b, Structure of full-length RnPARKIN (PDB 
4K95 (ref. 16), left), PhPARKIN-pUb (middle), and a superposition of the two 
(right), in which Co atoms of structurally identical, ordered residues of the 
IBR-REP linker are shown as spheres. The distance between these residues is 
indicated by a dotted line. The linker sequence is highlighted in Extended 
Data Fig. 2. c, Structures of PARKIN in which B-factors were refined for 
individual atoms (PDB 4BM9 (ref. 14); 411H (ref. 15)) are shown in a ribbon 


representation, in which the ribbon thickness indicates B-factors differences, 
with thin ribbons indicating low and thick ribbon indicating high B-factors. 
The full-length RnPARKIN structure (PDB 4K95 (ref. 16)) is not included as in 
this 6.5 A structure overall B-factors were assigned. d, Structures of 
PhPARKIN-pUb are shown as in c for each molecule of the asymmetric unit. 
The REP and RING2 elements are destabilized as indicated by higher B-factors. 
The rigid core of the protein has shifted from the UPD-RING1-RING2 
interfaces (compare c) to the UPD-RING1-IBR-phosphoUb interfaces. 
B-factor analysis may be distorted by neighbouring molecules and 

crystal contacts, which are not indicated here. 
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Extended Data Figure 7 | The Ile44 patch is essential for PINK1-mediated 
phosphorylation of Ub and PARKIN Ubl. All assays were performed three 
times with consistent results. a, b, Coomassie-stained PhosTag gels comparing 
the phosphorylation of (a) the HsPARKIN Ubl domain (amino acids 1-72) and 
(b) ubiquitin. In both cases, the wild-type form is compared with the 144A 

mutant form of the protein. GST-TcPINK1 does not efficiently phosphorylate 


the 144A mutants of ubiquitin or of the HsPARKIN Ubl domain. This is 
important since the Ie44 patch in the PARKIN Ubl domain is inaccessible and 
binds to RING1 in the structure of full-length RnPARKIN (PDB 4K95 (ref. 16)) 
(see Fig. 4a). c, d, Controls for Fig. 4c. Coomassie-stained gel with proteins 
labelled (c), and full-size blot (d) for Fig. 4c. Molecular weight markers are in 
kDa for all panels. 
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Extended Data Figure 8 | NMR analysis of phosphorylated Ubl. a, BEST- 
TROSY spectra for isotope-labelled HsPARKIN Ubl domain (dark green) and 
phospho-Ubl domain (light green) with resonances assigned for the Ubl 
domain. b, Chemical shift perturbation of Ubl with respect to phospho-Ubl, 
showing significant perturbations in the region of phosphorylation (Ser65), the 
last B-strand and neighbouring f-strands. Grey bars, exchange broadened 
resonances. ¢, Mapping of perturbed resonances onto the previously 


determined NMR structure of HsPARKIN Ubl (PDB LIYF (ref. 36)). The 
perturbed residues cluster in the Ser65-containing loop and in proximity to the 
Ile44 patch of the Ubl. d, Mapping of the perturbed resonances to the structure 
of RnPARKIN (PDB 4K95 (ref. 16)) shows that they perturb the interface 
between the Ub] domain and the PARKIN core. Thus, phosphorylated Ubl may 
not be able to (re)bind PARKIN at the same binding site. 
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Extended Data Figure 9 | PARKIN phosphoUbIl does not compete with 
phosphoUb. a, A possible scenario is that the phosphoUbl competes with 
phosphoUb for the phosphoUb binding site on PARKIN. This could be 
favoured since the interaction would occur in cis. b, Fluorescence polarization 
competition experiment increasing the concentration of Ubl, phosphoUbl 

or phosphoUb with respect to full-length HsPARKIN in the presence of FlAsH- 
tagged phosphoUb. Measurements were performed in triplicate with error 
bars given as standard deviation from the mean. While unlabelled phosphoUb 
competes with labelled phosphoUb in the reaction, Ubl or phosphoUbI do not 
compete with phosphoUb. c, Binding of FIAsH-phosphoUb to HsPARKIN 
and phospho-HsPARKIN. The measurements were performed in the same 


experiment as samples in Fig. 2d. If phosphoUbl interacts with HsPARKIN 
in cis, phosphoUb binding should be inhibited. In contrast, binding of 
phosphoUb to phospho-PARKIN is slightly enhanced as also reported in ref. 7. 
Measurements were performed in triplicate and error bars represent standard 
deviation from the mean. d, PARKIN phosphorylation assays as in Fig. 4c, 
including phosphoUbI as well as phosphoUb (both at 10 uM). While addition 
of phosphoUb induces PARKIN phosphorylation, addition of phosphoUbl 
does not, indicating that the Ubl domain is not released from the PARKIN core. 
The experiment has been performed three times with consistent results. 
Molecular weight markers are in kDa. 
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Extended Data Figure 10 | PhosphoUbl-induced ‘opening’ of PARKIN 
relies on a phosphate pocket in the UPD. a, A second scenario would be that 
the phosphorylated Ubl domain rebinds to PARKIN at an alternative site. We 
previously speculated that the UPD contains a phosphate-binding site that 

is lined by two AR-JP patient mutations, K161N and K211N, and also contains 
Arg163 (ref. 14). b, Full-length HsPARKIN mutants as indicated were 
phosphorylated in vitro with GST-PhPINK1, resolved on SDS-PAGE and 
western blotted using an anti-pSer65 PARKIN antibody (Abcam cat no. 
ab154995). These proteins were used in c. c, Ub-vinyl sulfone (Ub-VS) 
modification of the active site Cys431 of Hs3PARKIN and HsPARKIN mutants 
with and without phosphoUb from a time course experiment is assessed on 
Coomassie-stained gels. Ub-VS reacts with ‘open’ forms of PARKIN, and 
was previously shown to modify phospho-PARKIN but not phosphoUb- 
activated PARKIN’. Phospho-PARKIN-mediated opening depends on the 
phosphate pocket present in the UPD, since phospho-PARKIN K211N, 
K161N (two AR-JP patient mutations) and R163E abrogated or impaired 


modification by Ub-VS and do not appear to have an accessible catalytic Cys, 
while the phosphorylated phosphoUb-binding-deficient mutant K151E is 
readily modified. The experiment was performed two times with consistent 
results and gels have been collated from two different assays (indicated by the 
gap). d, PARKIN ubiquitination reactions in presence of E1, UBE2L3, ubiquitin 
or ubiquitin S65A, ATP and GST-TcPINK1 for 2h. PARKIN is activated by 
phosphoUb or by PARKIN UbI phosphorylation in absence of phosphoUb 
(with ubiquitin S65A) (lanes 2/3). Mutants in the UPD phosphate pocket can 
still be activated by phosphoUb (albeit not to the same extent) (lanes 5, 8, 11) 
but are inactive when the Ubl domain is phosphorylated (lanes 6, 9, 12). 

This could suggest that the phosphoUbl binds back to the PARKIN UPD 
pocket. However, we cannot exclude the possibility that, for example, the linker 
between the Ubl and the UPD plays a more active role in PARKIN activation. 
The experiment was performed three times with consistent results. Molecular 
weight markers are in kDa for b-d. 
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Propagation of conformational changes during 
4-Opioid receptor activation 


Rémy Sounier', Camille Mas', Jan Steyaert?*, Toon Laeremans”* 


Héléne Déeméné? & Sébastien Granier’ 


H-Opioid receptors (uORs) are G-protein-coupled receptors that 
are activated by a structurally diverse spectrum of natural and 
synthetic agonists including endogenous endorphin peptides, 
morphine and methadone. The recent structures of the pOR in 
inactive’ and agonist-induced active states (Huang et al., ref. 2) 
provide snapshots of the receptor at the beginning and end of a 
signalling event, but little is known about the dynamic sequence of 
events that span these two states. Here we use solution-state NMR 
to examine the process of pOR activation using a purified receptor 
(mouse sequence) preparation in an amphiphile membrane-like 
environment. We obtain spectra of the OR in the absence of 
ligand, and in the presence of the high-affinity agonist BU72 alone, 
or with BU72 and a G protein mimetic nanobody. Our results show 
that conformational changes in transmembrane segments 5 and 6 
(TM5 and TM6), which are required for the full engagement of a G 
protein, are almost completely dependent on the presence of both 
the agonist and the G protein mimetic nanobody, revealing a weak 
allosteric coupling between the agonist-binding pocket and the 
G-protein-coupling interface (TM5 and TM6), similar to that 
observed for the B2-adrenergic receptor’. Unexpectedly, in the 
presence of agonist alone, we find larger spectral changes involving 
intracellular loop 1 and helix 8 compared to changes in TM5 and 
TM6. These results suggest that one or both of these domains may 
play a role in the initial interaction with the G protein, and that 
TM5 and TM6 are only engaged later in the process of complex 
formation. The initial interactions between the G protein and 
intracellular loop 1 and/or helix 8 may be involved in G-protein 
coupling specificity, as has been suggested for other family A 
G-protein-coupled receptors. 

HORs are membrane proteins belonging to the G-protein-coupled 
receptor (GPCR) superfamily that are responsible for the analgesic and 
addictive properties of morphine’, a natural alkaloid contained in 
opium’. In fact, due to the clinical importance of opioids in pain 
management and addiction, five decades of medicinal chemistry gen- 
erated a very chemically diverse opioidergic ligand system (reviewed in 
ref. 6) that can modulate the HOR intracellular signalling, namely the 
inhibitory heterotrimeric G; protein and arrestin pathways’. The phar- 
macology of these different compounds are very well characterized and 
they present a wide spectrum of activity with respect to efficacy 
towards G protein and/or arrestin from very weak partial agonism 
to full agonism”*. Evidence suggests that activation of the arrestin 
pathway may be responsible for some of the adverse clinical effects 
of the opioids drugs currently on the market’. 

Intrinsic efficacy is a parameter used to define the effect of a ligand 
on GPCR activation of specific signalling pathways’. To understand 
intrinsic efficacy, it is necessary to understand the process of GPCR 
activation. So far this process has only been described for three GPCRs 
using biophysical and crystallographic analyses of both inactive and 
active states: rhodopsin”, the B2-adrenergic receptor (B2AR)'"’* and 


, Aashish Manglik*, Weijiao Huang“, Brian K. Kobilka‘, 


muscarinic M2 receptor (M2R)"*. The most complete characterization 
of the GPCR activation process by diffusible ligands came from NMR 
studies on the B,AR’*"*, showing that the binding of agonist alone is 
associated with conformational heterogeneity in the receptor and is 
not sufficient to stabilize the active B,AR conformation. Whether this 
observation is generalizable to other GPCRs has not been determined. 

Crystal structures are now available for the OR in both inactive and 
fully agonist-induced states, representing the beginning and end of a 
signalling event (Huang et al., ref. 2). Here, in an effort to better 
understand the process of LOR activation upon binding of opioid 
agonists, we took advantage of the versatility of NMR spectroscopy 
(as recently described for the B.AR”’) to analyse the activation signal 
propagation occurring in different receptor domains (see Extended 
Data Fig. 1 and Methods for further technical details). 

Dimethylamines give peaks with different intensities in the ‘H-'°C 
heteronuclear multiple quantum coherence (HMQC) spectrum of 
unliganded '*C-dimethylated pOR ('*Cm2-HOR) preparations 
(Fig. 1a) that we unambiguously assigned to ten lysines and to the 
shortened N terminus (starting at G52) using systematic mutagenesis 
and proteolysis experiments (Extended Data Fig. 2). It is interesting to 
note the coexistence of broad and irregular peaks (that is, Kos" 
K100°! and K344®°! (ICL1, intracellular loop 1)) with sharp and 
intense signals (K209®*"?, K269°4 (ECL2, extracellular loop 2)) in 
the apo-state. Because a peak line width directly reports on the homo- 
geneity of the local fluctuating magnetic field which depends on the 
dynamics of the labelled domain, this observation qualitatively indi- 
cates that the different domains under investigation (that is, TM6 and 
ICL1-helix 8 (H8)) present different dynamic properties (in the 1s to 
ms range). Indeed, broad peaks suggest that the e-N[’*CH3], moieties 
detect two or more conformations (environments) that exchange on 
an intermediate time scale (low ms), while sharp peaks may represent a 
single conformation or two or more conformations that exchange on 
either a faster or slower time scale. The absence of a signal for a specific 
lysine may represent a lack of side chain dynamics or a dynamic time- 
scale in the ‘slow intermediate’ regime”. We have focused our analysis 
on a specific set of NMR correlation peaks that were separated in two 
groups of sensors: the extracellular (G52 and K209°°!”) and intra- 
cellular (K98 1, K100 1", K260°-%, K269°4, K271°° and K344°°!) 
lysines (Fig. 1b-d). The signals for other probes are very weak 
(K174'C?, K233°4°, K303°°") or even not observable (K141°?° and 
K185**), and spectral modifications are difficult to interpret 
(Extended Data Fig. 2). 

We first analysed the effect of BU72, a high-efficacy WOR agonist”’, 
on the NMR signals using a saturating amount of ligand and/or a 
saturating amount of a G protein mimetic nanobody (Nb33) 
(Fig. 2). We selected BU72 because of its extremely high efficacy/ 
potency and its slower dissociation rate relative to other agonists. 
Moreover, this ligand was successfully used in the crystallization stud- 
ies (Huang et al., ref. 2). Nb33 is closely related to Nb39, the nanobody 
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Figure 1 | HMQC spectrum of unliganded '*C-dimethylated »OR and 
positions of lysine residues in the inactive (cyan blue) and active (orange) 
HOR. a, Position and identity of eleven correlated 'H-'°C peaks corresponding 
to the pOR dimethyllysines and amino terminus. b, Superposition of inactive 
(cyan blue, 4DKL) and active (orange, 5C1M) conformation of 1OR 
highlighting the lysines (sticks) and the e-NH)p (spheres) positioned in the 
extracellular face (yellow box) and in the intracellular face (blue box) of the 
HOR. Also shown are the BU72 ligand (green spheres) and the G protein 
mimetic nanobody position (salmon surface). c, d, Extracellular (c) and 
intracellular (d) view of the LOR showing the reorganization of the lysine side 
chains upon receptor activation from inactive (cyan spheres) to fully active 
(orange spheres). The shortened amino-terminal domain (N terminus) starting 
at G52 was not modelled in the inactive state structure but makes contact with 
the 7TM bundle to cap the binding pocket in the agonist-induced state. 


used to stabilize and capture the OR agonist-induced state in crystal- 
lization studies (Huang et al, ref. 2). Nb33 and Nb39 differ by only two 
amino acids in a region that does not directly interact with the receptor, 
and they have similar G-protein-like effects on the affinity of agonist for 
the WOR. They also have similar effects in NMR experiments (Extended 
Data Fig. 3). We used Nb33 throughout this study because it can be 
purified in much larger quantities than Nb39 (see Methods). 

In the extracellular surface, the peak corresponding to K209*°"” was 
slightly modified upon treatment with BU72 (Fig. 2a, LR panel) and, as 
better observed in the 'H dimension, was decreased in intensity and 
shifted downfield (Fig. 2b, ECL2 panel). As discussed above, this spec- 
tral modification indicates a change in the conformational state of the 
WOR, probably as a result of a modification of the K209"°"’ micro- 
environment, as the B-hairpin containing this residue undergoes a 
change upon receptor activation (Fig. 1c). When the sample was 
treated with a saturating concentration of the Nb33 only, no significant 
changes were observed (Fig. 2a, RNb panel), suggesting an absence of 
environment changes around K209*'* in the absence of agonist. 
When both BU72 and Nb33 were added to the '*Cm2-pOR, the 
K209*°'” peak intensity decreased further as compared to the sample 
treated with BU72 alone (Fig. 2a, LRNb panel and Fig. 2b, ECL2 panel). 
We also observed changes in the N terminus upon binding of BU72 
alone; however, in this case the peak intensity increases. BU72 and 
Nb33 co-treatment slightly increased the effect observed for BU72 
alone (Fig. 2b, N-terminal panel). It is of interest that the active-state 
crystal structure of the OR reveals that the N terminus folds into the 
binding pocket and interacts with BU72 (Huang et al., ref. 2). 

The potentiating effects of Nb33 on BU72 are much more pro- 
nounced in the intracellular part of '*Cm2-OR, as BU72 or Nb33 
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Figure 2 | Activation of tOR by BU72. a, HMQC spectra of unliganded °C- 
dimethylated LOR (30 |1M, R, black), 3C-dimethylated HOR treated with a 
saturating concentration of BU72 (150 uM, LR, blue), 5C-dimethylated HOR 
treated with BU72 and a saturating concentration of Nb33 (60 1M, LRNb, 
green) and of unliganded ‘*C-dimethylated OR treated with a saturating 
concentration of Nb33 (60 1M, RNb, orange). The small crosses indicate the 
position of peaks visible in the apo-state that are reduced in intensity or no 
longer visible in the other spectra. Arrows indicate the position of the 1D 
slice represented in b. b, 1D slice of HMQC spectra in the 'H dimension 
highlighting the effect of BU72 alone (blue) and BU72 + Nb33 (green) on the 
N terminus (G52) and on the indicated set of dimethyllysines (ECL2, TM5, 
TM6, ICL1 and H8 lysines). The yellow and blue boxes indicate the positions 
in the extracellular and intracellular domains, respectively. The zoom level 
for each peak and the '°C chemical shifts are indicated in the top left of each 
panel. Asterisks indicate the chemical shifts of methyl probes in the presence 
of both BU72 and Nb33. 


induced only small spectral changes by themselves (Fig. 2a, LR and 
RNb panels), while formation of the ternary complex led to more 
dramatic spectral modifications (Fig. 2a, LRNb panel). The proton 
peak centred around 2.9 p.p.m., which corresponds to the lysines 
K269°*4, K271°7° and K303°** positioned in TM6, almost disap- 
peared (the leftover peak being K303°°*, see assignments in 
Extended Data Fig. 2) with a concomitant appearance of an intense 
peak at 2.85 p.p.m. due to K269°** and K271°”° alone (Fig. 2b, TM6 
panel). This spectral shift, which is not observed in spectra from 
receptor treated with BU72 alone, is consistent with the 10 A outward 
movement of TM6 upon receptor activation that repositions the 
K269°*4 and K271°”® side chains (Fig. 1d). The K260°°° peak intensity 
is also markedly decreased by exposure to both BU72 and Nb33, and 
thus represents a sensor for the receptor conformational change in 
TM5 (Fig. 2b, TM5 panel). These results show that binding of the 
high-affinity agonist BU72 alone is not sufficient to fully stabilize the 
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Figure 3 | Quantitative analysis of spectral changes in intracellular domains 
of pOR and a structural interpretation. a, b, Percentage of maximal effect in 
the volume of indicated peaks in sample treated with agonist alone (BU72, 
[Dmt']DALDA and DAMGO) (a) or agonists together with Nb33 (b), 
highlighting the stronger effect of agonist in absence of Nb33 on the ICL1 and 
H8 domains as compared to TM5 and TM6 domains. Error bars represent the 
uncertainty in volume determination due to random noise (see Methods). 

c, Model proposing the propagation of conformational changes from the 
agonist-binding pocket to the ICL1 and H8 domains initiating interactions with 
the G protein G; (represented by the «5 C-terminal domain of G;, yellow circle) 
(middle panel) before the fully agonist-induced state of the tOR is reached 
(bottom panel), with the direct interaction of the G;,,5 with TM5 and TM6. 
Numbered circles represent the identity and the position of the TM domains 
viewed from the intracellular side of the membrane. 


active conformations of TM5 and TM6 and support the notion of a 
weak connection between the ligand-binding and the G-protein-coup- 
ling domains of the LOR. This weak coupling was previously observed 
for the B,AR, and was thoroughly investigated more recently using 
NMR and DEER spectroscopy’. Of interest, the results obtained with 
the BAR and the LOR stand in contrast to rhodopsin, for which a 
stronger link between photoisomerization of retinal and structural 
changes at the cytoplasmic end of TM6 has been observed”. 

In addition to the TM5 and TM6 probes, we also characterized the 
conformational transitions in the ICL1 (K98'"! and K100'") and 
H8 domains (K344°°') of OR that also undergo conformational 
changes upon LOR activation (Fig. 1d). Comparison of the R panel 
with the LRNb panel in Fig. 2a revealed a near complete loss of intens- 
ity of K98'“", K100'"" and K344°*! peaks upon OR treatment with 
BU72 and Nb33 (Fig. 2a, b, ICL1/H8 panels, green trace). While a 
complete loss of intensity for these peaks was only observed with both 
BU72 and Nb33, we did observe a reduction in intensity with BU72 
alone (Fig. 2b, ICL1/H8 panels, blue trace). The loss of peak intensity 
could originate from a change in the local dynamic of ICL1 and H8 
domains in the [1s to ms timescale that broadens the correlation peaks 
beyond the detection limit, a well characterized phenomenon in NMR 
spectroscopy”, and suggests that receptor activation is markedly 
modifying the structural dynamic of these domains. It should be noted 
that none of the lysines discussed here directly interact with the related 
Nb39 in the crystal structure. Changes in the NMR signal intensity 
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therefore reflect the LOR activation process. Moreover, the effects of 
agonist and Nb33 can be reversed by treatment with the antagonist 
naloxone (Extended Data Fig. 4). 

Taken together our data demonstrate that binding of the G protein 
surrogate is required for the agonist to fully stabilize the active con- 
formation in both extracellular and intracellular domains, suggesting 
the existence of a two-way allosteric coupling between the extracellular 
HOR ligand-binding domain and the G-protein-coupling interface. 
These data are in agreement with what has been observed with the 
isotopically labelled B,AR™, and suggest that this behaviour could be a 
common feature of the GPCR activation process. 

We next quantitatively analysed the effect of BU72 alone (LR) and 
together with Nb33 (LRNb) on the volumes of the different intracel- 
lular lysine peaks (from TM5, TM6, ICL1 and H8 domains). Of par- 
ticular interest, we found a notable difference in the effect of BU72 
alone on TM5 and TM6 compared to its effect on ICL1 and H8 
domains (Fig. 3a). While BU72 alone had a slight effect on TM5 and 
TM6 signal intensity, it produced larger spectral changes for the ICL1 
and H8 sensors (Fig. 3a), suggesting that the allosteric coupling from 
the agonist-binding pocket to ICL1/H8 is stronger than the coupling to 
TM5 and TM6. This effect is not specific to BU72 as we observed the 
same phenomenon with two other agonists: [Dmt']DALDA (2’,6'- 
dimethyltyrosine-b-Arg-Phe-Lys-NH2) and DAMGO (Tyr-p-Ala- 
Gly-NMe-Phe-Gly-ol) (Fig. 3a and Extended Data Figs 5 and 6). 
Notably, the formation of the ternary complex leads to larger volume 
changes for the different domains under investigation (Fig. 3b) and, 
together with Fig. 3a, demonstrates that the fully agonist-induced state 
can only be achieved in the presence of an agonist and a G protein 
mimetic nanobody. 

While we observe that agonists alone have larger effects on ICL1 
and H8 reporters than on TM5 and TM6 reporters, our steady-state 
experiments do not provide information about the temporal sequence 
of these conformational changes; however, we speculate that a G 
protein may first engage ICL1 and/or H8 before engaging TM5 
and TM6. In fact, the H8 domain is critical for G-protein coupling, 
as observed in rhodopsin™* and for other GPCRs (see ref. 25). 
Notably, the H8 of the muscarinic M3R was shown to cross-link with 
the Gyq protein in an agonist-dependent manner”, suggesting a close 
proximity between the receptor H8 domain and the G protein. Our 
results support a model where G protein could initially interact with 
the H8 and/or ICL1 domains of the receptor before docking in the 
7TM core and stabilizing the TM5 and TM6 domains in an open 
conformation (Fig. 3c). 

Here we describe the measurement of conformational changes 
associated with HOR activation in both the extracellular and intracel- 
lular domains. Our data offer insights into agonist regulation of WOR 
subdomain conformations, revealing a possible allosteric pathway of 
activation from the agonist-binding pocket to the ICL1 and H8 
domains to initiate the ternary complex formation and associated 
changes in TM5 and TM6. Importantly, our data complement the 
high-resolution structural data of the WOR agonist-induced state 
bound to BU72 described in a companion article (Huang et al. 
ref. 2) and offer unprecedented insights into the structural mechan- 
isms of opioid receptor activation. A better understanding of the 
structural basis for opioid receptor activation may lead to new thera- 
peutic approaches with fewer side effects. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Protein expression and purification. Nanobody (Nb) sequences were subcloned 
into a pMalp2x vector containing an N-terminal, 3C protease-cleavable maltose 
binding protein (MBP) tag and a C-terminal 8 X His tag. Plasmids were trans- 
formed into BL21(DE3) cells and protein expression induced in lysogeny broth 
(LB) by addition of IPTG to 1 mM at an ODgop of 0.8. After 24h of incubation at 
20 °C, cells were collected and sonicated in the presence of lysozyme and benzo- 
nase. MBP-nanobody fusions were purified by Ni-NTA chromatography and 
MBP was removed using 3C protease. Cleaved MBP was separated from the 
nanobody by additional amylose purification and size exclusion chromatography 
in a buffer containing 20 mM HEPES pH 7.5 and 0.1 M NaCl. Of note, because we 
needed large amounts of Nb for the NMR experiments, we decided to use Nb33 
throughout this study as it was produced and purified in much larger quantities 
than Nb39 (almost ten times as much for the same volume of BL21(DE3) cell 
culture). 

We generated a LOR mouse construct with features designed to enhance 
stability for NMR spectroscopy. A tobacco etch virus (TEV) protease recognition 
site was introduced after residue 51, and a human rhinovirus 3C protease site after 
residue 358. A Flag epitope tag was added to the amino terminus and an 8 X His 
tag was appended to the carboxy terminus. Receptor expression was largely 
improved by using a M72T single-point mutation as assessed by naloxone binding. 
The final construct ({41OR-2x) is shown in Extended Data Fig. 1. 

The OR-2x construct was expressed in Sf9 cells using the Bestbac (Expression 
systems) baculovirus system in the presence of 31M naloxone. For all other 
mutants (assignments), we used the bac-to-bac system (Life Technologies). Sf9 
cell cultures (Life Technologies) were grown to a density of 4 X 10° cells per ml, 
infected with baculovirus containing the 1;OR-2x genes, shaken at 27 °C for 48 h, 
and cell pellets were harvested and stored at —80 °C. The receptor was purified as 
previously described’ and eluted from the anti-Flag M1 affinity resin with a buffer 
containing 0.01% MNG, 0.001% CHS, 20 mM HEPES pH 7.5, 0.1 M NaCl, 1 1M 
naloxone, 0.2 mg ml ' Flag peptide and 2mM EDTA. 

To remove flexible amino and carboxy termini, TEV and 3C protease were 

added at a 1:5 and 1:10 protease:,OR-2x ratio by weight. The sample was incu- 
bated at 4°C overnight in the presence of 100}1M of TCEP. We then used a 
negative Ni-NTA chromatography step to remove TEV and 3C proteases. 
HOR reductive methylation. Receptor preparation from the Ni-NTA flow 
through were incubated at 4°C overnight with 10mM '*C-formaldehyde and 
10mM NaBH;CN. Excess of reagent was eliminated by dialysis and ‘*Cm2- 
HOR was further purified by SEC chromatography in a buffer containing 0.01% 
MNG, 0.001% CHS, 20 mM HEPES pH 7.5 and 0.1 M NaCl. 

The monodisperse peak was then concentrated to 20 to 40 tM final, and dia- 
lysed in 98.85% DO buffer with 0.01% MNG, 0.001% CHS, 20 mM HEPES-d18 
pH7.4 (uncorrected) and 40 mM NaCl. 

NMR spectroscopy. Final samples (~270 ul at 20-40 11M) were loaded into 
Shigemi microtubes susceptibility matched to D,O. All data for ligands and 
mutant studies were acquired on 700 MHz and 500 MHz Bruker Avance III spec- 
trometers, respectively (Bruker, Rheinstetten, Germany), both equipped with 
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5 mm cryogenic H/C/N/D probes with zaxis gradient. 'H-'*C correlation spectra 
were recorded using heteronuclear multiple-quantum coherence (HMQC) experi- 
ments in echo/anti-echo mode. °C and 'H chemical shifts and peak line widths in 
the HMQC spectra reveal the chemical and magnetic environments of the *C- 
methyl probes as well as their dynamic properties. Spectral widths in @1 and 2 
were 8,417.5 Hz (6,009.6 Hz) and 3,519.6 Hz (2,515.2 Hz) at 700 MHz (500 MHz), 
respectively centred at 40 p.p.m. in the ‘°C dimension. ‘°C decoupling was per- 
formed with a GARP4 sequence. Typically, 134 (81) complex points with 32-48 
scans per FID were recorded, to ensure a 27-Hz resolution per point at 700 MHz 
(500 MHz) before zero filling. The relaxation delay was set to 1.5s. Thirty-two 
steady-state scans preceded data acquisition. Total collection time varied between 
3 and 4h, depending on the sample concentration. Spectra were processed using 
NMRPipe/NMRDraw software” and visualized using CCPNMR”*. 

All ligands were dissolved in deuterated DMSO to 10 mM and directly added to 

the sample in the Shigemi tube at a final concentration of 150 uM. Nb33 and Nb39 
were concentrated to 0.6 mM and dialysed in 100% D,O buffer with 0.01% MNG, 
0.001% CHS, 20mM HEPES-d18 pH 7.4 (uncorrected) and 40 mM NaCl. The 
nanobodies were added directly in the Shigemi tubes at a final concentration of 
60 1M before data acquisition. 
Peak volume analysis. All NMR spectra were processed using the suite of pro- 
grams provided in the NMRPipe/NMRDraw software distribution’’. The spectra 
were normalized using DSS (2,2-dimethyl-2-silapentane-5-sulfonic acid) as an 
internal reference. For peak fitting analysis, spectra were processed with a squared 
sine-bell window function in each dimension, and zero-filled to 4096 * 1024 data 
points in time domain data ft, and f,, respectively. Spectra were fit with the pro- 
gram nlinLS, provided as a part of nmrDraw package. Gaussian models were used 
for the fitting in each dimension, starting from values obtained from the peak- 
picking routine in nmrDraw. The quality of the fits was examined visually by 
estimating the residual difference between the experimental data and the results 
of the model calculations. Peak volumes in apo-state, ligands alone (BU72, 
[Dmt']DALDA, and DAMGO) and ternary complexes spectra were extracted 
from the peak fitting. Errors in the peak volume were calculated based on the 
effect of random noise for the peak height estimated by nlinLS. 

To directly compare the ligand effects on the apo-state relative to the maximal 
effect obtained with BU72-Nb33 treatment (fully agonist-induced state), we nor- 
malized the peak volumes with the volume difference between the apo-state and 
ternary complex condition (BU72-Nb33) as the 100% (Vapo — Vgu72-nb33)- 
Ligand effects are represented as percentage of the maximal effect: 


Vapo — Vi or L—Nb33 


100 x . For the TM6 data, because of the overlapping peaks 


Vapo — Vpu72—Nb33 
in the area of interest, we considered the peak volume of K269°4 and K271°”° in 
the ternary complex with BU72 as the 100% (indicated as K269* and K271* in 


V = 
Fig. 2) and in this case the percent of maximal effect is 100 x J USELE NS The 


BU72—Nb33 
uncertainty was determined by V AV, or L-Nb33” + AVapo” + A Vgu72—Nb33”- 


27. Delaglio, F. et al. NMRPipe: a multidimensional spectral processing system based 
on UNIX pipes. J. Biomol. NMR 6, 277-293 (1995). 

28. Vranken, W.F. etal. The CCPN data model for NMR spectroscopy: development of 
a software pipeline. Proteins 59, 687-696 (2005). 
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Extended Data Figure 1 | Overall strategy for the preparation of '°C- 
dimethylated pOR and quality control. The wild-type OR contains a total of 
12 lysine residues strategically positioned to sense conformational dynamics 
in both extracellular and G-protein-coupling domains upon receptor 
activation. To monitor the HOR activation process in solution in an amphiphile 
membrane-like environment, we exploit the sensitivity of two '*C-methyl 
groups covalently bound to the s-NHp of lysine side chains (¢-N[’*CH3]>- 
lysines) and to the o1-NH) of the receptor N terminus as NMR probes, an 
approach recently validated in the GPCR field’. We slightly modified the 
wild-type sequence to facilitate receptor purification and to remove flexible N 
and C termini for improvement of sample stability in NMR measurements. 
a, Snake plot presenting the 1OR-2x construct and highlighting the 12 
endogenous lysines (red circles), the protease cleavable motifs (black circles) 
and the Flag and 6 X histidine tags (grey circle). Extracellular and intracellular 
surfaces are coloured yellow and blue, respectively. b, The different 
biochemical steps were analysed by SDS-PAGE (left panel) and a typical 
final size-exclusion chromatography highlighting the monodispersity of 
'3C-dimethylated OR is shown (right panel). c, Shown is the reductive methyl- 
ation of the N terminus and of lysine side chains after reaction with 
'8C-formaldehyde ("°C in green) in the presence of sodium cyanoborohydride 
(NaBH3CN). Amino acids are shown in ball-and-stick representation and, for 
clarity, side-chains of unmodified amino acids are replaced by large spheres. 
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This reaction is known to minimally affect protein structure and function and, 
as previously observed with B2-AR (see ref. 19), it did not affect the 
functionality of }OR as '°C-dimethylated LOR binds both antagonist 
(Naloxone) and agonist (BU72) with a similar affinity than unlabelled 1}OR 
(d). The left panel in d represents the mean = s.d. of triplicates for one 
experiment representative of three experiments in total (n = 3). The right 
panel in d represents normalized values + s.d. (larger binding value being 100% 
and smaller 0%) for one experiment representative of three experiments in total 
(n = 3). Kg and ICso values are the mean + s.d. of the three indepen- 

dent experiments for both receptor preparations. In addition, we observed 
similar agonist-induced interaction with Nb39 between OR or '°C- 
dimethylated OR using pull-down experiments (e), demonstrating that 
methylation does not prevent the agonist-dependent interaction with Nb39. 
Saturation binding experiments on soluble HOR or '*C-dimethylated pOR 
(both at 100 nM) were done in the presence of an increasing amount of 
radiolabelled naloxone (up to 1 uM) and non-specific binding was determined 
in the presence of 100 LM of naloxone. Competition assays on soluble HOR or 
'8C-dimethylated OR were done with 100 nM of radiolabelled naloxone 
and increasing amount of BU72. Free and receptor-bound radioligands were 
separated using gel-filtration columns. Total binding was plotted as a function 
of [?H]-naloxone or BU72 concentration and data were analysed using 
Prism with saturation or competitive binding analyses. 
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Extended Data Figure 2 | Assignments of the N terminus and 
dimethyllysine peaks. Each single mutant was expressed, purified and labelled 
as described for WOR-2x. For each mutant, we analysed the R and LRNb 
spectra. a-k, Left panels represent the 2D spectra with the indicated lysine to 
arginine mutant in red. On the right panels, we highlight the peak 
disappearance for each mutant (red line) as compared to the wild-type hOR 
preparation (black line) in the 'H dimension in the receptor alone condition (R). 
In some instances, peaks are overlapping in the area of interest and to assign 
these peaks we had to record the NMR spectra in the ternary complex situation 
(LRNb). This was the case for K269 (h), K271 (i), and K303 (j). In h, i and j, 
# indicates the position of the lysine peak from the Flag tag that we measured in 
some samples and that is due to an incomplete cleavage of the N terminus by the 
3C protein. For samples used in the analysis of ligand and Nb effects, we almost 
eliminated this peak by optimizing the 3C cleavage step as described in the 
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Methods (treatment with 100 1M TCEP). The peak denoted with a # in panel 
i between wild type (no TCEP) and K271R (100 1M TCEP) is an example of 
the improvement of 3C cleavage in TCEP-treated samples as indicated by the 
almost complete disappearance of the peak in both R and LRNb conditions 
(K271R, red line). The K98 peak was assigned by deduction (Fig. 1a). Indeed, we 
generated and recorded NMR spectra of 11 mutants of the 12 endogenous 
lysines and the only peaks that were never affected in all the mutant spectra and 
left unassigned necessarily correspond to the K98 residue. 1, The assignments 
of D1 (full-length N terminus) and G52 (cleaved N terminus) were inferred from 
the spectra obtained before (red line) and after cleavage with 3C (black line). 
The G52 loss of signal is better observed in the LRNb situation (G52* almost 
completely disappears). Asterisks indicate the chemical shifts of methyl probes 
in the ternary complex condition. 
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Extended Data Figure 3 | Comparison of Nb33 and Nb39 effects on NMR __ the fully agonist-induced state of pOR. 1D slice of HMQC spectra in the 'H 
spectra. HMQC spectrum of '*C-dimethylated OR in the ternary complex _ dimension (right panels) highlighting the similar effect of Nb33 and Nb39 in 
situation (R, 30 uM; BU72, 150 uM; Nb, 60 uM) for Nb33 acquired with a the presence of BU72 on the indicated set of dimethyllysines and compared to 
700 MHz spectrometer (blue trace) and for Nb39 acquired with a 500 MHz the untreated sample (black dotted line). 

spectrometer (red trace). Both spectra are very similar and are characteristic to 
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Extended Data Figure 4 | Partial reversal of agonist effects by treatment (60 {tM, LRNb, [Dmt']DALDA (green) or naloxone (light pink) (c). d, Spectra 
with the antagonist naloxone. a-c, HMQC spectra of unliganded 3G: of the [Dmt']DALDA-RNb sample treated with naloxone (900 11M) (L°RNb, 
dimethylated ,OR (30 1M, R, black) (a), the same sample bound toa saturating magenta). Green arrows in 1D traces highlight the peaks movement in the 
concentration of ligand (150 1M, LR, [Dmt']-DALDA (blue) or naloxone (red) _activation process (c) and for the partial reversal effect of naloxone (d) for K209 
(b) and the same sample treated with a saturating concentration of Nb33 and K344/K98 peaks. 
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N terminal 
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Extended Data Figure 5 | Activation of »OR by [Dmt']DALDA. a, HMQC 
spectra of unliganded 13 C-dimethylated LOR (30 1M, R, black), BC. 
dimethylated OR treated with a saturating concentration of [Dmt']DALDA 
(150 uM, LR, blue), 3C-dimethylated WOR treated with [Dmt']DALDA anda 
saturating concentration of Nb33 (60 LM, LRNb, green) and of unliganded *C- 
dimethylated }OR treated with a saturating concentration of Nb33 (60 4M, 
RNb, orange). The small crosses indicate the position of peaks visible in the 
apo-state that are reduced in intensity or no longer visible in the other spectra. 


45.27 p.p.m. 
x6 


Arrows indicate the position of the 1D slice represented in b. b, 1D slice of 
HMQC spectra in the 'H dimension highlighting the effect of [Dmt']DALDA 
(blue) and [Dmt']DALDA + Nb33 (green) on the N terminus (G52) and 

on the indicated set of dimethyllysines (ECL2, TM5, TM6, ICL1 and H8 
lysines). The yellow and blue boxes indicate the positions in the extracellular 
and intracellular domains respectively. The zoom level for each peak and the 
°C chemical shifts are indicated in the top left of each panel. Asterisks indicate 
the chemical shifts of methyl probes in the ternary complex condition. 
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LETTER 


N terminal 
46.79 p.p.m. 
x2 


Extended Data Figure 6 | Activation of OR by DAMGO. a, HMQC spectra 
of unliganded a C-dimethylated LOR (30 [1M, R, black), 2 C-dimethylated WOR 
treated with a saturating concentration of DAMGO (150 1M, LR, blue), 1G: 
dimethylated OR treated with DAMGO and a saturating concentration of 
Nb33 (60 1M, LRNb, green) and of unliganded '*C-dimethylated LOR treated 
with a saturating concentration of Nb33 (60 1M, RNb, orange). The small 
crosses indicate the position of peaks visible in the apo-state that are reduced in 
intensity or no longer visible in the other spectra. Arrows indicate the 


45.27 p.p.m. 
x6 


position of the 1D slice represented in b. b, 1D slice of HMQC spectra in 

the ‘H dimension highlighting the effect of DAMGO alone (blue) and 
DAMGO + Nb33 (green) on the N terminus (G52) and on indicated set of 
dimethyllysines (ECL2, TM5, TM6, ICL] and H8 lysines). The yellow and blue 
boxes indicate the positions in the extracellular and intracellular domains 
respectively. The zoom level for each peak and the '°C chemical shifts are 
indicated in the top left of each panel. Asterisks indicate the chemical shifts of 
methyl probes in the ternary complex condition. 
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The dream budget 


The key to astrong grant application is a well-considered and detailed outline of 
expenses, says grant director Ingrid Eisenstadter. 


grant proposals from scientists who are 

seeking support for their work. I am the 
director of grants for a small family foundation 
that has supported research for more than 
half a century. My desk is the first step in a 
process that will determine the fate of these 
funding requests. I start by reading the pro- 
posal’s title and the introductory summary 
of its significance. 

Then, I turn to the budget page. 


S everal times a year, my inbox fills with 


The first thing I look at is the bottom line. If 
the full request is in our range, I skim through 
the rest with an eye for potential red flags. For 
example, we have a ceiling on overhead requests: 
we do not support operating funds or proposals 
that are primarily for equipment. Only after I 
have assessed whether there are issues with the 
budget do I return to the proposal itself. 

When drafting a budget, it is crucial to know 
the maximum grant amount awarded by the 
foundation to which you are applying. If you 


ask for much more, the granter might reject your 
proposal on that basis alone. Some organizations 
avoid publishing an exact figure to discourage 
applicants from reflexively requesting it. How 
can you find out? In the United States, the 
Foundation Center, a national database for 
grant-seekers, provides access to the ‘Form 990° 
tax return that all foundations are required to 
submit to the federal government (see go.nature. 
com/panexz). These returns disclose the dollar 
range of grants issued by a foundation each > 
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> year. You should also look at the granters’ 
websites. Most do not post their 990 forms, but 
some have annual reports or web pages that list 
project titles or recipients and the correspond- 
ing grants’ dollar amounts. Be aware that when 
granters see budgets, for example, for exactly 
US$25,000 or $50,000, many will suspect that 
aspects of the budget were contrived to reach 
this exact amount. Proceed with caution. 

It is also important to find out whether a 
foundation has specific requirements for the 
budget section of grant applications — and, if 
so, to observe them strictly. Many do, and they 
typically post this information on their web- 
sites along with their application guidelines. 
These rules will tell you whether you can, for 
instance, purchase equipment, support under- 
graduate assistants or request publication fees 
for journal articles that may or may not arise 
from your work. They also specify whether 
there is a ceiling for overhead — or ‘indirect’ 
— expenses. With our limited resources as 
granters, we want to fund research, not depre- 
ciation and administration, so if an institution 
submits an application that proposes adding 
50-70% — or even more — to the bottom line 
for overhead expenses, that can be a problem. 

The budgeting rules will also tell you whether 
you can ask for general operating support or 
only specific project-related funds, and whether 
the funder will award multiple-year grants. Find 
out before you start budgeting. If your questions 
are not answered on the website, send an e-mail 
to the foundation to ask for clarification. There 


is nothing unusual about these enquiries. 

The majority of the budget requests that 
we receive are indeed for our maximum grant 
amount, and many staff members from other 
grant-giving foundations have told me the same 
thing. Or, put another way, they have the same 
complaint. The place where we tend most often 
to see this escalation is in the budget’s salary line. 


SALARY SAMPLES 

We received an e-mail proposal recently that 
requested US$14,550, with more than half of 
that allocated to salary. When the hard copy 
of that proposal arrived, however, it looked 
as though the applicant had accessed our 
990 form, because the salary allocation — in 
an otherwise unchanged proposal — had 
more than doubled. The applicant had added 
months’ worth of support, raising the total 
request to $24,550. 

We also received a proposal in which the 
principal investigator indicated that he had 
asked a second funder for $15,000, in addition 
to the $25,000 that he was seeking from us. I 
asked to see that second budget and discovered 
that he would actually require an extra $1,700 
if he received only the smaller grant. That is, 
a total of only $16,700 would have allowed 
this preliminary investigation to proceed — 
in the absence of any salary support at all. He 
would have had to scale back his project, but its 
successful completion would have nonetheless 
enabled him to make a larger request for state 
or federal funding. 


WALK THE LINE 


Explain your budget 


Although not all granters specify that 

they want to see a page entitled ‘Budget 
justification’, itis wise to include one. This 
is a line-by-line explanation of itemized 
expenses in your request. Usually, just a few 
sentences are sufficient to address each 
expenditure. Also, keep in mind that your 
proposal could be going to a foundation 
whose evaluators are not trained in your 
speciality. To the extent possible, your 
budget justification should be written in 
language that is clear to a layperson. 

A good place to start is the salary line. 
We often see a one- or two-month salary 
request for projects that will last a year. 
Explain this. For instance, one common 
reason for the apparent discrepancy is that 
principal investigators devote just part of 
their time during the year to the proposed 
projects, so the request would cover 
100% of salary support for the work. In 
other instances, home institutions might 
pay staff researchers’ salaries during the 
academic year and expect them to raise 
salary funds only for the summer months. 


Whatever the reason, you need to clarify. If 
you are asking for a salary or stipends for 
undergraduate assistants or PhD students, 
include in your budget an hourly or daily 
rate or the percentage of time that they 

will devote to the project, so that granters 
understand the exact time commitment that 
they are being asked to fund. 

If you need to pay for supplies or tests, 
name them and explain how they will be 
used. The same goes for an equipment line. 
Explain why you need a new incubator or 
how you will use the desiccator, centrifuge 
or imaging equipment. If you are purchasing 
equipment that will outlive your research 
project, consider briefly explaining how 
you will use it in the future, so that the 
foundation understands the lasting benefits 
of its long-term investment. If you are asking 
for food and lodging support for travel, list a 
daily expense that explains the total. 

When you neglect to provide these 
explanations, you are asking evaluators to 
fish through your text for details about each 
budget line. This is not to your advantage. L.E. 
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Although both of these researchers should 
have explained their position, I did not 
consider either case to be an act of duplicity 
(we funded one of them). An applicant’s use 
of the salary line to increase a grant request is 
understandable to — if not welcomed by — 
the granter. There is prestige for a researcher 
in obtaining large grants, and some institutions 
do require staff to bring in salary support. 

You should visit your development office 
early in the budgeting process to talk to staff 
members and learn what they expect. Most 
universities and non-governmental organiza- 

tions are prepared 


“Tf agrant to reduce their over- 
application head requirements 
has sparkle to adhere to founda- 
— novelty and tions’ guidelines. We 
athoughtful often see budgets 
protocol — it in which the hie 

institution produces 
a ie matching funds to 


enable its researchers 
to stay within our 
support guidelines. If this is the case, consider 
adding that information to your budget page 
in acolumn labelled ‘matching funds or ‘other 
sources’ to show that your request is backed bya 
contribution to salary, fringe benefits, overhead 
expenses or any other support your home insti- 
tution is willing to provide. 

You might also want to include a column for 
other sources when you are applying to two or 
more granters for the same project. Here, it is 
important to explain in your budget justifica- 
tion whether the funds from these sources are 
already in hand (see ‘Explain your budget’). If 
not, you need to clarify whether the project 
will still proceed — perhaps ona smaller scale 
— ifthe other organizations fail to contribute. 
State in your application when you will know 
about any other funds. The timing is important 
because a granter will be reluctant to write you 
acheque if your project runs the risk of halting 
midway for lack of that support. 

Too much information in a budget is better 
than too little. On my desk recently was a pro- 
posal from a principal investigator who wanted 
to travel to South America — with her dog. 
This added, yes, 13 lines to her budget, ranging 
from $30 for a flea-and-tick collar to $400 for a 
canine plane ticket. The researcher explained 
that her dog is not merely her travel compan- 
ion but also a well-trained scat-detection dog 
—a necessity for her research. We funded this 
interspecies team. 

If a grant application has sparkle — novelty 
and a thoughtful protocol — it could persuade 
us to work with the researcher to reduce the 
final request to better fit our resources. Or we 
may even agree to write a cheque that is larger 
than usual. m 


Ingrid Eisenstadter is director of grants for 
The Eppley Foundation for Research in New 
York City. 


Ua SCIENCE FICTION 


IN ANEW LIGHT 


BY S. R. ALGERNON 


CC Lucifer, said Ned Goodman, 

() sotto voce, as he knelt at a stone 

altar in a clearing far from town. 

“Take corporeal form. Appear before me. 

Offer me your power and I will do as you 
command.” 

Goodman placed his hands on an iron- 
bound codex that rested on the altar. Soon, he 
thought, there will be no more weekends run- 
ning focus groups for toothpaste and deodorant. 
Soon, he would wield real power. 

Goodman had studied every known 
ancient language, but the markings on the 
pages meant nothing to him. The phonetic 
transcription — on foolscap and weighed 
down with a rock — had come from a 
deranged monk ofa disgraced order. He had 
said — among his more lucid ravings — that 
Hephaestus had forged the codex aeons ago. 

Goodman read the incantation. As he did, 
the earth shook, and the ground on the other 
side of the altar collapsed into a molten pool. 
Stray tufts of grass near its edge caught fire. 
The air stank of sulfur, and it stung Good- 
mans eyes. A brilliant column of liquid metal 
oozed up from the pool, taking on a red glow 
as it cooled. It ballooned out on top like a 
mushroom or a jellyfish. Flames erupted 
from its surface, as if the protuberance wore 
a corona or a ring of horns. 

“Are you Lucifer?” asked Goodman. 

“You might say I am a servant thereof” 
The glowing form grinned, revealing a 
tongue of flame behind pointed teeth. It 
extended a hand, sheathed in flame. 

“Follow me. Lucifer awaits.” 

“Follow you? Where?” 

“To a world of fire. You shall bathe in acid 
under the weight ofa stormy, sulfurous sky.’ 

Goodman shrank back. 

“I know you serve the Prince of Darkness 
and all? said Goodman, “and I don't mean to 
be impertinent, but aren't you supposed to 
grant me fearsome powers before dragging 
me off to perdition?” 

“Powers?” The agent of Lucifer’s smile 
faded. “We have geothermal power, if that’s 
what you mean” 

“Aren't I supposed to get a chance to do 
some sinning first?” 

“Are you that monk that’s been bothering 
us? Look. We told you last time...” 

Discordant howls from beyond the portal 
drowned out the rest of the reply. A spray of 
shimmering metal drenched the agent from 
the right side, dulling the glow of its horns 


Career opportunities. 


momentarily. A few stray droplets landed on 
the altar, setting the foolscap alight. 

“Hey, you two! No splashing ...! Where was 
I? Look, I don't have time to argue with you. 
Did you request a portal to Lucifer or not?” 

“T did? 

“Lucifer? The Morning Star?” 

pe 

“The Evening Star?” 

“Ts that a tabloid or something?” 

“Venus?” 

“Are you telling me that that molten glob 
and those choking fumes are from Venus?” 

“What else would they be?” 

What good was Venus? thought Goodman. 
He recalled the people who had mocked 
his quest over the years and considered 
throwing them into the portal. That would 
certainly be devilish, but he couldn't quite 
muster a thirst for vengeance. He had been 
so busy tracking down the codex that he 
hadn't had time to acquire any real enemies. 

“But what about the contract signed in 
blood?” asked Goodman. “What about the 
pitchforks and tormented souls?” 

“I... dont really...” said the agent of Luci- 
fer. “Could you hold for just a minute?” 

Goodman watched the ground smoulder, 
wondering if he should just take the codex 
and go home. An iron beach ball bounced 
by within the swirling haze. Vague shapes 

galloped after it. 


> NATURE.COM “So sorry to keep 
Follow Futures: you waiting,’ said the 
Y @NatureFutures agent of Lucifer upon 


Ei go.nature.com/mtoodm its return. “We closed 
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down our Earth operations centuries ago, 
including the guest-worker programme.” The 
agent glanced down at the codex. “It seems 
that your information is out of date” 

Goodman sighed and then coughed asa 
stray plume of carbon dioxide extinguished 
a smouldering corner of the notes. 

“You're saying you don't want my soul?” 

“Tm afraid not. Look, I know we should 
have been clearer about the policy changes, 
and I hate to see you leave empty-handed” 

The agent’s eyes brightened to a pleasant 
shade of orange. “I know! Some of our histori- 
cal Earth clients have requested heavy metals. 
Our factories had a cancelled order from the 
Ishtar Terra Children’s Marching Band. Could 
I interest you in an oboe made of osmium?” 

“No, I don’t think...” 

“A tuba made of tungsten?” 

“No. You really don’t want my soul? I fig- 
ured souls would still be marketable in this day 
and age. There's so much evil in the world” 

“Times change. Back then, we needed 
someone to help us bring organics and the 
like back from Earth. Ever since the Oxy- 
gen Catastrophe, wed been short-changed 
on raw materials, but then you came along, 
with your city-states and language capacity. 
We figured rather than letting you waste your 
potential hitting each other with swords... 
Anyway, now that we have the aerostats, we 
get what we need from comets and asteroids?” 

“Aerostat ...?” 

“Buoyant craft, about a mile wide. We use 
them as construction platforms and launch 
sites” 

“You have cloud cities?” 

“They're automated, except for the main- 
tenance crew — descendants of our original 
human workforce. It's far too cold up there for 
our liking. There's not much up there, now that 
the new resort is up and running. Probably just 
pools of liquid water and some overgrown 
greenhouses, I imagine. I could offer you a 
job up there, but youd probably find it boring” 

“I was thinking something more along 
the lines of interplanetary tourism director. 
If youre going to start up your Earth opera- 
tions again, you need to rethink your image.” 

“We do?” 

Goodman thumped the codex for empha- 
sis. “Absolutely. First, see if your factories can 
whip up a big of pair of pearly gates. m 


S. R. Algernon studied fiction writing 

and biology, among other things, at the 
University of North Carolina at Chapel Hill. 
He currently lives in Singapore. 
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